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Science for all 


Many women are deterred from pursuing a career in science at the highest levels. Much more must 
be done to address the reasons behind this potential waste of human talent. 


hether female scientists will want to 
W cee International Women’s Day 
on 8 March may depend on how far 
they look back in time. Things have changed, and 


WOMEN IN SCIENCE 


The gender gap and how to close it 
nature.com/women 


young female scientists have female role models. 
Some argue that setting a quota for women in 
leading academic positions such as professor- 
ships will result in mediocre female candidates 


if you talk in terms of decades, there are consid- 
erable victories to cheer about. But despite those victories, progress 
now seems to have stalled. 

That is clear from the package of articles in this week’s Nature (see 
page 21) that exposes the dismaying extent to which sexism still exists 
in science. In the United States and Europe, around half of those who 
gain doctoral degrees in science and engineering are female — but 
barely one-fifth of full professors are women. Women are not invited in 
significant numbers to sit on the scientific advisory boards of start-up 
companies. A scientific conference at which half of the keynote speak- 
ers are women stands out simply because of that. 

Why has progress stalled? Childcare is one major factor that blocks 
the career of many women. But that is a practical issue, theoretically 
easy to fix given political will. Even the most enlightened childcare 
policies will not fix a second, more insidious major problem: overt or 
unconscious gender bias. It can be detected even in female scientists, 
and even, according to neurobiologist Jennifer Raymond (see page 33), 
in those who actively promote women within science. 


POLITICS 
The fate of women in science can be influenced for good and bad by 
political systems. In communist China, women and men had a fairly 
equal presence in science until recently, when its tottering attempts to 
open up to the capitalist world led to a disproportionate recruitment 
of young men returning from research training abroad. This sets the 
stage for gender imbalance in the future. At the other end of the political 
spectrum, Portugal's twentieth-century dictatorship similarly managed 
to secure a healthy balance of female professors. That may have been 
for the wrong reasons, such as that those jobs were of low salary and 
prestige, or because a bellicose foreign policy sent male graduates to 
fight in colonial wars. Most of those men chose to stay abroad, leaving 
the academic field open for women. The role models were therefore in 
place when Portugal became a democracy in 1974 and began to invest 
in research in the 1990s. A healthy gender balance continues there. 
The worrying gender bias in mature democracies won't be resolved 
by the flick of a master switch. As in most professions, it is locked in 
place by male dominance at all the levels of decision-making that affect 
academic careers — from journal editorial boards, to grant-reviewing 
boards, to academic selection committees. Women are barely visible at 
these levels, fixing the subconscious idea that science belongs to men. 
There are many ways to chip away at this invisibility and they should all 
be tried, with the results published so that others can learn from them. 
One serious proposal is the imposition of quotas. In certain contexts, 
such as academic promotions, this would be a good way to ensure that 


being promoted. But there is a gap in reasoning 
here. Women and men are equally talented, so if men occupy a large 
majority of high-level posts, there must be an awful lot of mediocrity 
among their number. Is mediocrity more acceptable in men? Quotas on 
decision-making committees, however, do come with the inbuilt prob- 
lem of overburdening the few women who already hold top positions. 
The solution here would be to keep the quota realistically low for now. 


PROFILE RAISING 

Individuals — from Nobel prizewinners to lowly postdocs — have a 
part to play. Some laureates have contributed with foundations. The 
Rita Levi-Montalcini Foundation supports young women in Africa 
who wish to become scientists. The Christiane Niisslein Volhard 
Foundation supports young female scientists with children. And other 
scientists engage in various consciousness-raising activities. 

At the institutional level, several European research agencies and 
institutions have special funding programmes for women, allowing 
them to establish their own labs. Last November, Nature made a public 
challenge to itself by reporting that only 14% of its reviewers and 19% of 
its invited Comment and World View authors were female (see Nature 
491, 495; 2012). We vowed to improve, and have asked our editors to try 
harder to engage with women. In time, we will make our progress public. 

One useful tool is the online platform AcademiaNet 
(http://academianet.info), created by the Stuttgart-based Robert Bosch 
Foundation in Germany in cooperation with Spektrum der Wissen- 
schaft, the German edition of Scientific American (which is owned by 
the Nature Publishing Group). AcademiaNet gives a web presence to 
high-achieving female scientists, making them visible to conference- 
programme committees seeking female speakers, journalists seeking 
experts to quote, head-hunters seeking board members and the like. The 
network will become even more important as work to address gender 
imbalance accelerates. With successful women being both fewer and 
less likely to push themselves forward than their male counterparts, 
they can be hard to find for even the most enthusiastic gender-balancer. 

AcademiaNet was opened in 2010 by German chancellor Angela 
Merkel, a former physicist, who stressed how important it is to preserve 
half ofa country’s scientific potential. She recalled her own difficulties 
training with men who would rush impulsively to try to solve practical- 
class problems. Her style was to think and then try out, by which time 
the equipment was occupied or broken. Might an option for separate 
training in some areas be useful for women, she pondered? Merkel 
knows a thing or two about being a successful woman, so let’s add her 
idea to that list of things to try. It’s a long list. It's time to get started. 
What are you waiting for? = 
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WORLD VIEW .ennisicor sen 


United States last year. It also damaged Mitt Romney’s chances in 
the presidential election. Quotes from Republican primaries, where 
Romney called for responsibility for disaster response to shift from the 
federal government to state and local authorities, suddenly looked fool- 
ish as those local authorities were quickly overwhelmed. Yet, even as 
the aftermath of Sandy demonstrates the need for federal help, the Fed- 
eral Emergency Management Agency (FEMA) in New York is losing 
US$1.3 billion — roughly 5% of its budget — in government cutbacks. 
The cut to FEMA is a false economy. If we do not prepare for the 
growing threats that FEMA deals with, we will pay more when disas- 
ter strikes. Worse, there is a political effort under way to delegate its 
responsibilities to regional and local authorities. 

Discussions on FEMA largely fail to acknowledge the agency’s cru- 
cial functions. Climate change, combined with 
more human development in vulnerable areas, 
will lead to more Sandy-like events. The question 
that many countries face is not whether they need 
a national agency to manage crises, but how to 
run such an agency in an era of catastrophic risk. 

FEMA provides a classic public good. It offers 
services that the market will not and in which state 
and local governments under-invest. It makes 
more sense, and is cheaper, to develop high-level 
risk expertise at the national rather than local level. 
This was, in fact, the rationale for the creation of 
FEMA in 1979. Governors were tired of dealing 
with a confusing federal approach to disaster 
response, and convinced then-president Jimmy 
Carter of the need for a single central agency. 

A coordinated response is essential because 
large-scale crises cut across governmental boundaries and can quickly 
exceed the capacity of local responders. FEMA fills the unique role 
of lead coordinator. Much of the public may think of it as a national 
response agency, but this is not really the case. It is too small to do that, 
and by law becomes involved only when local and state responders 
become overwhelmed. 

Crisis-response networks of different public, non-profit and private 
organizations coordinate more effectively if responders speak the same 
language and follow the same general principles. Only a national gov- 
ernment can enforce common standards, and so avert the confusion 
when multiple approaches conflict in their attempts to manage haz- 
ards. After the terrorist attacks of 11 September 2001, FEMA and the 
Department of Homeland Security forced states and local governments 
to adopt common preparation and response 


Guniesse Sandy did more than rock the eastern coast of the 


principles. This approach is not perfect, butit NATURE.COM 
establishes role expectations before acrisisandis Discuss this article 
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THE PUBLIC 
HAS COME TO 


EXPECT 
THAT A LARGE-SCALE 
CRISIS WILL BE MET 
WITHA 


COMPETENT 


NATIONAL RESPONSE. 


A central agency 1s crucial 
for disaster response 


The United States must not follow through with its plans to scale back its 
Federal Emergency Management Agency, argues Donald Moynihan. 


Australia, the United Kingdom and China, have followed the US lead. 

FEMA also helps states to recover, through measures that include 
providing money for disaster relief. This is the part of FEMA’s func- 
tion that provides direct and tangible benefits to individuals and is 
subject to close attention by elected officials. Studies from both the 
United States and India suggest that incumbent politicians tend to 
lose support after natural disasters, but that cash for relief efforts can 
minimize the political damage. This provides a strong incentive for 
governors and congressional delegations to seek emergency declara- 
tions that trigger federal disaster relief, and for the president to provide 
them. The result has been a marked rise in the number of presidential 
emergency declarations made since the 1990s, including declarations 
for events not traditionally seen as disasters. 

Disaster relief has become politicized, and this drains attention 
from FEMAs other main function — to mitigate 
and prepare for disasters. Such investments are 
made in the absence of a particular threat, and so 
seem abstract and of limited value to voters. But 
estimates suggest that every dollar invested in 
mitigation creates long-term savings of between 
$4 and $15. We need FEMA to drive the revision 
of building codes to make buildings more resil- 
ient to extreme weather, and to encourage states 
to build climate change into their hazard plans. 

Although Congress was happy to find 
$60 billion for disaster relief in the wake of 
Sandy, it is unwilling to invest a similar amount 
to reduce the impact of the next superstorm. His- 
torically, such investments are made only when 
FEMA takes the lead to set national standards 
and share costs. To transfer power from FEMA 
to local officials would neuter this ability. 

To prepare for disaster means building strong working relationships 
between the organizations that form the crisis-response network. 
Training and simulation exercises build trust, which is essential in 
an emergency. Yet cutting them is seen as an easy way to save money. 
Before Hurricane Katrina hit New Orleans, Louisiana, in 2005, a 
planned exercise to test hurricane responses was postponed, and a 
follow-up workshop was cancelled. Without such investments, we can 
expect more responses that look like those to Katrina — responders 
disagreeing about who is in charge, and failing to work together. 

The public has come to expect that a large-scale crisis will be met 
with a competent national response. Politicians can help if they enable 
FEMA to better manage long-term risk. They must, because, however 
much they want to, they cannot wish away the next Sandy. = 


Donald Moynihan is a professor at the La Follette School of Public 
Affairs at the University of Wisconsin-Madison. 
e-mail: dmoynihan@lafollette.wisc.edu 
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RESEARCH HIGHLIGHTS 


How migraines 
begin 


Researchers have pinpointed 
a brain stress signal that may 
spark migraine pain. 

Migraines are thought 
to be caused by a wave of 
cellular depolarization that 
travels through the brain's 
cortex. Turgay Dalkara and 
his colleagues at the Hacettepe 
University in Ankara induced 
depolarizing waves in the 
exposed brains of mice by 
pricking the cortex with a 
pin or by applying potassium 
chloride to it. 

The authors then 
used molecular and 
pharmacological tools to 
document the cascade of 
molecular events that led to 
activation of the trigeminal 
nerves, which innervate the 
face and are implicated in 
migraines. 
Science 339, 1092-1095 (2013) 


Fine anatomy of 
earliest animals 


Internal structures of 
embryonic jellyfish-like 
organisms have been found in 
limestone that formed more 
than 530 million years ago. 
Basic animal body plans 
were established over 
halfa billion years 
ago, and their 
origins are 
often murky. 
Differing 
hypotheses 
have tied one 
relatively 
common 
specimen, 
Olivooides, to three 
disparate phyla, of which 
penis worms, sea stars and 


jellyfish are modern examples. 
A team led by Philip Donoghue 


at the University of Bristol, 


Selections from the 
scientific literature 


Fish migration reduces predation 


Fish in Danish lakes substantially reduce 

their risk of being eaten by making annual 

migrations. Thus, predation — alongside food 

and weather — may drive such movements. 
Christian Skov at the Technical University of 

Denmark in Silkeborg and his team implanted 

2,219 roach (Rutilus rutilus, pictured) with 

tags during a 4-year period, and monitored 

individuals’ migratory behaviour. Most of 

the tags recovered from a nearby colony of 


cormorants (Phalacrocorax carbo) were from 
fish that were last recorded as being in lakes. 
Roach that spent most time in the lakes during 
winter were substantially more likely to be 
eaten by cormorants than those that moved to 
streams. 

This study provides direct evidence that 
animals can benefit from migration by 
reducing their risk from predators. 

Biol. Lett. 9, 20121178 (2013) 


UK, found three specimens in 
which the internal anatomy 
(pictured) of late-stage 
Olivooides embryos had been 
preserved, revealing 
details that place it 
with Cnidaria, which 
includes jellyfish. 
The authors 
argue that 
*)) assigning ancient 
organisms to 
the correct group 
is crucial to our 
understanding of how 
changes in embryonic 
development led to the 
formation of current animal 
phyla. 
Proc. R. Soc. B 280, 20130071 
(2013) 
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Worm signal for 
river blindness 


A newly identified chemical 
could help to monitor the 
treatment of a tropical disease 
that afflicts tens of millions of 
people. 

Onchocerciasis, commonly 
known as river blindness, is 
caused by the parasitic worm 
Onchocerca volvulus. The 
worm can persist in the body in 
nodules, even after treatment. 

Kim Janda at the Scripps 
Research Institute in La Jolla, 
California, and his colleagues 
studied urine from people 
with the disease, aiming 
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to find a marker of active 
infection. The team founda 
previously unknown molecule 
called N-acetyltyramine- 
O,B-glucuronide by using 
liquid chromatography—mass 
spectrometry. The molecule — 
which is derived from one of 
the worm’s neurotransmitters 
— was present at high levels in 
the urine of infected people, but 
at much lower levels in healthy 
people and patients who were 
receiving antibiotic treatment. 
The authors suggest that a 
similar approach could be used 
to find biomarkers for other 
tropical diseases. 
Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1221969110 (2013) 


JAKOB BRODERSEN 


DUSTIN KLECKNER/WILLIAM T. M. IRVINE 


CHEMICAL BIOLOGY 


Protein caught in 
the act 


A technique that can track 
proteins at atomic resolution 
in live cells could help to show 
how proteins fold and mature 
into functional forms. 

A protein's environment 
strongly influences its 
maturation, but most analyses 
of protein dynamics require 
purified components. Radu 
Aricescu of the University of 
Oxford, UK, Lucia Banci of 
the University of Florence, 
Italy, and their groups used 
nuclear magnetic resonance 
spectroscopy to track the 
structure of the protein SOD1 
in human cells. 

The technique showed that 
another protein helps SOD1 to 
bind zinc and copper, and form 
internal disulphide bonds that 
aid stability. The method also 
showed that disulphide bonds 
can form even when SOD1 
has not bound copper, a result 
that has not been seen using 
purified proteins. 

Nature Chem. Biol. http://dx.doi. 
org/10.1038/nchembio.1202 
(2013) 


Tying fluids in 
knots 


Vortex loops, of which smoke 
rings are a familiar example, 
occur when a fluid or gas 
spins in a tornado-like funnel 
that turns back on itself. The 
behaviour of ‘knots’ that can 
form in vortex loops is hard 
to study, but these important, 
complex objects have now been 
created experimentally. 

Dustin Kleckner and William 
Irvine of the University of 


Chicago in Illinois generated 
vortex loops in water (pictured) 
by using precisely shaped 
plastic wings produced with 

a three-dimensional printer. 
They watched how these loops 
change over time; flows in 
vortices break and reconnect, 
causing knots to unknot. The 
evolution of knots is important 
to our understanding of energy 
transfer in solar plasmas, 
electromagnetic fields and 
turbulent fluids. 

Nature Phys. http://dx.doi. 
org/10.1038/nphys2560 (2013) 
For a longer story on this research, 
see go.nature.com/jjtthk 


Before the 
bacterial cell wall 


Much of the machinery in cells 
is devoted to division, but in the 
earliest cells this process may 
have been governed by changes 
in the ratio of surface area to 
volume. 

Although all bacteria either 
have cell walls or are thought 
to have evolved from ancestors 
that did, Jeff Errington and 
his colleagues at Newcastle 
University, UK, found that 
excess production of cell 
membrane in mutants of the 
bacterium Bacillus subtilis 
drives them into a state without 
cell walls. Non-mutant bacteria 
forced into a shape with a 
greater surface-to-volume 
ratio also lost their cell walls. 

In both cases, cells divided 
through irregular bulges, rather 
than dedicated cell-division 
machinery. 

In addition to providing a 
model for early cells, this work 
could help to explain how 
some infectious bacteria resist 
antibiotics. 

Cell 152, 997-1007 (2013) 


RESEARCH HIGHLIGHTS MiiiSaiaa¢ 


COMMUNITY 


CHOICE 


X-rays read beer’s bitter chemicals 


3 HIGHLY READ 
on http://onlinelibrary 
wiley.com in January 


A technique developed in the twentieth 
century has helped researchers to 
pin down the conformations of the 


compounds that flavour one of the 


world’s oldest beverages. 


A team led by Werner Kaminsky at the University of 
Washington in Seattle used X-ray crystallography to study the 
bitter-tasting chemical humulone — a component of the beer 
additive hops — and its derivatives. 

The authors purified acids recovered during beer-making 
and grew them into salt crystals that were then analysed with 
X-rays. This showed the exact position of the chemicals’ 
molecular side chains, contradicting previous assumptions 
made from easier but less-rigorous techniques. 

Now the arrangements of the humulone compounds 
are known, their purported health benefits can be better 


explored, the authors suggest. 


Angew. Chem. Int. Edn 52, 1553-1555 (2013) 


| ___ MEDICAL SCIENCES 
Sleepless nights 
affect gene activity 


Sleep deprivation alters the 
expression of hundreds of 
genes, including some whose 
activity normally varies 
depending on the time of day. 

Derk-Jan Dijk and his team 
at the University of Surrey in 
Guildford, UK, allowed study 
participants to sleep for up to 
10 hours every night for one 
week and up to 6 hours a night 
another week. At the end of 
each week, all 26 participants 
were asked to stay awake for 
about 40 hours. 

Genes affected by sleep 
deprivation included 
those involved in DNA 
packaging, gene-expression 
regulation, metabolism, 
and inflammatory, immune 
and stress responses. The 
authors suggest that studying 
the effects of sleep on gene 
expression can help to 
show how sleep deficits are 
linked to problems, such as 
cognitive impairment and 
obesity. 
Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1217154110 (2013) 


Hot star witha 
cool layer 


The nearby star a Centauri A 
has a relatively cool layer above 
its visible surface and beneath 
its superhot corona, one of 
several characteristics it shares 
with the Sun. 
A team led by René Liseau 
at Chalmers University of 
Technology in Onsala, Sweden, 
looked at a Centauri A in far- 
infrared wavelengths with the 
Herschel Space Observatory 
and a ground-based telescope. 
They compared the star’s 
light with a model of the 
stellar atmosphere to show 
a minimum temperature of 
3,920 kelvin just above the 
surface — the first minimum 
observed ona Sun-like star. 
Knowledge that this cool 
layer exists in other stars 
could help astronomers to 
understand how widespread 
such stellar atmospheric 
phenomenaare. 
Astron. Astrophys. 549, L7 (2013) 
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SEVEN DAYS nescnns 


POLICY 


US science cuts 

After US lawmakers failed to 
reach a budget compromise, 
federal science programmes 
took a 5% hit to their annual 
budgets on 1 March — part of 
across-the-board government 
cutbacks known as 
sequestration. National Science 
Foundation director Subra 
Suresh said that the agency 
would probably maintain 
funding for currently approved 
research grants, and eliminate 
about 1,000 new grants. At the 
National Institutes of Health, 
director Francis Collins said 
that the agency would probably 
not complete funding for some 
multi-year grants this year. 

See go.nature.com/8vdcht 

for more. 


European advisers 
The president of the European 
Commission, José Manuel 
Barroso, announced on 

27 February the establishment 
ofa council to advise him on 
where research could support 
social and economic growth 

in the European Union (EU). 
The Science and Technology 
Advisory Council comprises 15 
high-ranking scientists and will 
meet several times a year under 
the chairmanship of Barroso’ 
chief scientific adviser, Anne 
Glover, to promote the public 
acceptance of science and 
technology and to support 
‘evidence-based’ EU policy- 
making. Members include 
Swiss molecular biologist Susan 
Gasser, head of the Friedrich 
Miescher Institute in Basel, 

and neuroscientist Tamas 
Freund, director of the Institute 
of Experimental Medicine in 
Budapest. 


India budget 

Indian scientists were 
disappointed for a second 
year by budget allocations 
for science and technology 
in the government’s 2013-14 


Polar bear stays on US endangered list 


The US Court of Appeals for the District of 
Columbia has upheld a 2008 decision to list the 
polar bear as threatened under the Endangered 
Species Act. The Fish and Wildlife Service's 
2008 listing relied in part on climate-model 
projections indicating that the animals’ 
summer sea-ice habitat may disappear in 


budget request. Nine research 
departments share some 
US$6.9 billion, a mere 4% 
more than budgeted for 
2012-13, and below the rate 
of inflation. India’s prime 
minister, Manmohan Singh, 
had promised large increases 
in science spending from 2012 
to 2017, but these have not 

yet materialized. C. N. R. Rao, 
who heads the prime minister’s 
science advisory council, says 
that he is glad that science at 
least escaped big cuts. 


Patent reform 

The UK government has 
revealed plans to change 
national patent law to ensure 
that clinical trials do not count 
as a breach of existing drug 
patents. To obtain marketing 
approval, new drugs must be 
compared with medicines 
already on the market, but 
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such comparison trials could 
infringe the patents of existing 
drugs. Under rules announced 
on 26 February, which are 

still subject to parliamentary 
approval, clinical trials will 

be exempt from these patent 
restrictions. Other countries, 
including the United States 
and Canada, already have 

such exemptions in place. See 
go.nature.com/rpuunl for more. 


Pipeline push 

The US Department of 

State has identified no 

major concerns in its draft 
environmental assessment ofa 
controversial 1,400-kilometre 
pipeline proposed to link oil 
sands in Alberta, Canada, to 
existing pipelines in Nebraska. 
Opponents argue that the 
Keystone XL project would 
encourage the development 
of carbon-intensive energy. 
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the future, making this the first time that a 
species has been listed primarily on the basis 
of the threat of global warming. On 1 March, 
the appeals court rejected a challenge filed by 
the state of Alaska and others, ruling that the 
underlying science is “adequately explained 
and uncontested”. 


But the analysis, issued on 

1 March, concludes that 
approval or denial of the 
project is “unlikely to have 

a substantial impact” on the 
rate of development of the oil 
sands. Many analysts expect 
the US government to approve 
the pipeline, although it denied 
an initial proposal last January. 


| RESEARCH 
HIV cure 


An infant born with HIV has 
been “functionally cured”, a 
multi-institute team reported 
on 3 March ata conference 

on retroviruses in Atlanta, 
Georgia. The child received 

a combination of antiviral 
drugs within 30 hours of birth 
and for the next 18 months, at 
which point doctors lost track 
of the infant for 10 months and 
treatment ceased. Subsequent 


STEVEN KAZLOWSKI/SCIENCE FACTION/CORBIS 
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blood tests found no sign of 
the virus. Quick treatment, the 
doctors believe, prevented the 
formation of dormant infected 
cells in which the virus can 
persist. It is the first time that 
acure has been reported ina 
child with HIV. 


Fukushima review 
The World Health 
Organization released on 

28 February its report on the 
health impacts of the 2011 
catastrophe at the Fukushima 
Daiichi nuclear power plant in 
Japan. The report concludes 
that the accident will have 
little health impact apart from 
small, but significant, increases 
in cancers among populations 
ina few hotspots exposed to 
higher doses of radioactivity. 
But the health consequences 
would have been much worse 
had prevailing winds during 
the accident not blown most 
of the massive release of 
radioactive elements out to sea. 
See go.nature.com/fkp9vb 

for more. 


Sis 
Obama’s new team 


After weeks of speculation, 
on 4 March US President 
Barack Obama nominated 
Ernest Moniz, a physicist at 
the Massachusetts Institute 
of Technology in Cambridge, 
to succeed Steven Chu as 
secretary of Energy (see 
Nature 494, 409-410; 2013). 


TREND WATCH 


The number of non-human 
primates being imported into 
the United States for research 
purposes has dropped sharply 


Obama also named Gina 


McCarthy (pictured) to 
replace Lisa Jackson as head of 
the Environmental Protection 
Agency. McCarthy currently 
leads the agency’s office of air 
and radiation and previously 
served as commissioner of the 
Connecticut Department of 
Environmental Protection, 
where she worked ona 
regional greenhouse-gas 
trading initiative. 


Nobel physicist dies 
Donald Glaser, a Nobel 
prizewinning physicist and 
inventor of the bubble chamber 
used to track elementary 
particles, died on 28 February, 
aged 86. Glaser won the 1960 
Nobel Prize in Physics for his 
realization — in 1952, aged just 
25 — that a vat of superheated 
liquid could detect electrically 
charged particles better 

than the vapour-filled ‘cloud 
chambers’ in use at the time. 
He later switched to molecular 
biology, then to studies 


of human vision. He also 
co-founded Cetus in Berkeley, 
California, one of the first 
biotechnology companies. 


Drug-trial data 


The Swiss drug giant Roche 
announced last week that it 
would give researchers access 
to anonymized clinical- 

trial data. Requests for data 
would need to be approved 
by an independent panel 

of experts, the Basel-based 
company said. The move 
follows that made last month 
by GlaxoSmithKline to make 
data from all its future trials 
publicly accessible (see Nature 
490, 322; 2012). The European 
Medicines Agency intends 
from next year to make public 
all trial data submitted to it by 
industry for drug approvals. 


Arctic drilling pause 


Royal Dutch Shell announced 
on 27 February that it is 
suspending its Arctic drilling 
programme this year. Work 
commenced on two oil wells 
in Alaska’s Beaufort and 
Chukchi seas less than a year 
ago, but one drilling rig lost its 
moorings in July, and a second 
ran aground in December, 
sparking a federal review of 
the programme. Shell says that 
it plans to tow the rigs to Asia 
for maintenance and repairs 
as the company prepares for 
future drilling efforts. 


US PRIMATE IMPORTS DECLINE 


The number of non-human primates imported for research 
has fallen by 36% between 2006 and 2012. 


BUAV relaunches airline 
primate campaign 


(0) From elsewhere 
i From China 


since the British Union for the 
Abolition of Vivisection (BUAV) 
in London and People for the 
Ethical Treatment of Animals 
(PETA) in Norfolk, Virginia, 
started pressuring major airlines 
to stop the activity (see Nature 
489, 344-345; 2012). All buta 
few have acquiesced, including 
Air China and China Southern, 
which had been transporting 
large numbers of animals. 
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10 MARCH 

Comet C/2011 L4 
(PANSTARRS), which is 
coming into view in the 
Northern Hemisphere, 
brightens as it reaches 
the point in its orbit 
closest to the Sun. 
go.nature.com/hpf2yy 


11-12 MARCH 

At the Royal Society in 
London, researchers 
aim to set the agenda 
for the next decade 

of extrasolar-planet 
science. 
go.nature.com/a5cxah 


Dragon docking 

An unmanned Dragon 
spacecraft docked with the 
International Space Station 

on 3 March, arriving a day 
late because of thruster 
problems that were resolved. 
Built by California-based 

firm SpaceX, the space 
freighter carries supplies, 
equipment and scientific 
experiments. After completing 
its delivery — SpaceX’s 
second — Dragon is scheduled 
to return and splash into the 
Pacific Ocean on 25 March, 
bringing back more than 

1,300 kilograms of spent 
supplies and research samples. 
See page 18 for more on 
private spaceflight. 


FACILITIES 


Museum fire 


A large fire on 4 March gutted 
much of an iconic science 
centre: the 12,000-square- 
metre City of Science 
exhibition centre and museum 
in Naples, Italy, which opened 
in 2001. No one was hurt — the 
museum was closed at the time 
— but media reports say that 
the fire destroyed four of the 
complex’s five buildings. City 
authorities had not ruled out 
arson as Nature went to press. 


> NATURE.COM 
For daily news updates see: 
Www.nature,com/news 
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Aworker in Newport, Oregon, burns debris off a Japanese concrete dock that washed across the Pacific. 


Tsunami triggers 
invasion concerns 


Biologists track species on flotsam from Japan to US shores. 


BY VIRGINIA GEWIN 


hen a 165-tonne block of concrete 
and steel crashed into the central 
Oregon coast last June, Jessica 


Miller was shocked to find that the structure — 
a dock that had washed across the Pacific from 
Misawa, Japan — was teeming with life after 
15 months at sea. “It was surreal,’ says Miller, 
a marine ecologist at Oregon State University’s 
Hatfield Marine Science Center in Newport. 
She found tens of thousands of organisms on 
the structure in layers up to 15 centimetres 
thick, including brown algae, pink barnacles 
and shrimp-like creatures called caprellids. In 


December 2012, a second large Japanese dock 
landed in Washington state laden with spe- 
cies. Other debris, including derelict boats and 
buoys harbouring live organisms, continues to 
wash up on the shores of Oregon, Washington 
and, most recently, Hawaii. 

Almost two years after the Tohoku earth- 
quake triggered a devastating tsunami, North 
American shores are awash in debris swept 

from the Japanese 
coastline some 8,000 


For more on the kilometres away. The 
Tohokutsunami,see flotsam constitutes float- 
the Nature special: ing islands of species — 


some of which could 
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potentially be invasive. According to estimates 
from the Japanese government, the tsunami 
carried about 1.5 million tonnes of debris out to 
sea. “We expect to see tsunami debris for years 
to come,” says Peter Murphy of the National 
Oceanic and Atmospheric Administration 
(NOAA) in Silver Spring, Maryland, who is the 
Alaska coordinator for NOAAs marine debris 
programme. 

The tsunami has sparked several 
unanticipated experiments. Biologists have 
seized on the unprecedented opportunity to 
track a potential species invasion from the 
beginning, and oceanographers have used the 
debris sightings to refine their models of the 
effects of winds and currents on debris. Marine 
ecologists, meanwhile, are tracing fish migra- 
tions using radioisotopes released from the 
tsunami-triggered accidents at the Fukushima 
nuclear reactors. 

“The tsunami debris is an unparalleled 
scientific opportunity, specific in time and 
place,’ says Susan Williams, a marine ecologist 
at the University of California, Davis. 

The invasion biologists have perhaps had 
the most urgent call to action, because no one 
anticipated that coastal species would survive 
such a long journey across the open sea. “This 
is an event so rare we simply don’t expect to 
see it,” says James Carlton, an invasive-species 
expert from Williams College in Williams- 
town, Massachusetts, who has teamed up 
with Miller and other colleagues to study the 
arrivals. 

Researchers must first confirm that the 
debris they are studying really is from the tsu- 
nami. Water bottles with Japanese characters 
offer a clue. Registration numbers on boats 
can be traced back to those that were reported 
missing. But of the 1,500 or so reported items 
that have washed up in recent months, only 
21 have been confirmed by the Japanese con- 
sulate, according to Murphy. 

The arrival of potentially invasive species is 
nota new concern on the Pacific coast. Organ- 
isms can be transported on transoceanic boats 
or in their ballast water. But boats do not typi- 
cally recruit whole communities, and they 
move too fast between ports for many organ- 
isms to hang on. The resident coastal commu- 
nities transported on slow-moving tsunami 
debris therefore look very different — and 
can arrive along the whole North American 
coastline rather than just at heavily monitored 
ports. No invasions have yet been detected 
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TSUNAMI INVASION 


The 2011 Tohoku tsunami carried an estimated 4 
1.5 million tonnes of debris out to sea, including 
four large concrete docks. Two have washed up 

on North American shores, oe with life. 
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Me 


> from the tsunami debris, but it is unlikely 
that Carlton, Miller and their team are aware 
of every single landfall, so a potentially invasive 
species could well have been missed. 

The team’s next step is to determine which 
organisms survived the journey across the 
Pacific, and how. In the nine months since the 
first dock made landfall, the team has identi- 
fied roughly half of the 175 species found so far 
on all debris items. John Chapman, a marine 
biologist also at Hatfield Marine Science 
Center, will go to Misawa later this month to 
document which species reside on docks there 
in early spring, the season when the tsunami 
struck. 

The results so far suggest a real risk of inva- 
sion. For example, three of the best-known 
algal invaders have been among the debris, 
says Gayle Hansen, a marine-algae expert at 
Oregon State University who is based in New- 
port. She says that 75% of the 46 algal species 
she has collected from the debris so far have 
been reproductively active, dropping spores. 
That gives them a good chance of getting estab- 
lished and possibly displacing native species in 
the Pacific Northwest. 

State officials, who are usually first on the 
scene, are often quick to destroy any organisms 


‘) | MORE NEWS | 
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MORE needles could rats and sparks scepticism go.natlre. 
facilitate com/Ssttji 

ONLINE delivery of stem | @ Richard |’s heart shows Christians 
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clinging to debris to avert potential invasions. 
That can make it challenging for Carlton's 
team to get samples. But the problem is eas- 
ing, Carlton says, as more responders become 
aware of the need to sample the arrivals. He 
adds that there have been few reports of debris 
with living Japanese species from Alaska, Brit- 
ish Columbia and California, which could 
represent a lack of reporting, or an artefact of 
oceanography. 

The recent spike in debris landings is to be 
expected during winter and spring, owing to 
large-scale patterns of currents and winds. 
The path of debris flows north of Hawaii 
(see “Tsunami invasion’) and was predicted 
by NOAA models, which have incorporated 
sightings from mariners. But Nikolai Maxi- 
menko, an oceanographer at the University of 
Hawaii in Honolulu, says that the exact landfall 
for the concrete docks was difficult to predict 
because they are buoyant enough that the 
effects of winds and currents are comparable 
in strength. 

Maximenko is also helping to sample the 
plume of Fukushima radioisotopes, which 
lags behind the debris and is slowly moving 
east. The radioisotopes are assisting ecolo- 
gists. In February, Nicholas Fisher, a marine 
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scientist at Stony Brook University in New 
York, and his colleagues showed that two 
radioisotopes, caesium- 134 and caesium- 137, 
could be used to trace the past movements 
of bluefin tuna between Japan and Califor- 
nia (D. J. Madigan et al. Environ. Sci. Technol. 
http://doi.org/kn9; 2013). Fisher’s team 
confirmed the presence of both caesium 
isotopes in bluefin tuna arriving from Japan 
— although the levels pose no health risk to 
humans. “We see evidence that fish might 
cross the Pacific in about one month, which 
we found amazing,” says Fisher. 

Other researchers from Oregon State Uni- 
versity in Corvallis and NOAA are using the 
technique to determine whether there are 
two different stocks of North Pacific alba- 
core tuna, which would be reflected in each 
population’s isotopic profile. Fisher’s team 
also wants to use this method to track the 
migratory patterns of other large marine ani- 
mals, such as albatrosses, loggerhead turtles 
and salmon sharks. 

Even as researchers track species using 
radioisotopes and count them on debris, they 
bear in mind the tsunami’s staggering human 
toll. “This is the experiment that never should 
have taken place,’ says Chapman. m= 


Third ring 

of radiation 
makes surprise 
appearance 
around Earth, 
then disappears 
go.nature.com/iqvogh 
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ESA’s climate-eye dilemma 


Scientists face difficult choice for Europe’s next Earth- observation mission. 


BY QUIRIN SCHIERMEIER 


environmental research community is 

facing the difficult task of settling which 
of the three should be the priority for Europes 
next Earth-observing satellite. 

Around 250 Earth scientists and climate 
researchers will meet in Graz, Austria, this 
week to weigh up the scientific benefits of 
projects proposed for the roughly €300-mil- 
lion (US$390-million) seventh Earth Explorer 
mission of the European Space Agency 
(ESA).They face a choice between three pro- 
jects — Biomass, PREMIER and CoReH,O 
(an acronym referring to cold regions and 
water) — preselected through peer review 
from more than 20 proposals. The scientists 
behind the missions have been preparing their 
proposals for years. 

The Biomass project aims to take radar 
measurements of global forest biomass to 
assess terrestrial carbon stocks and fluxes. 
CoReH.,O, also a radar mission, would measure 
snow cover and snow-melt rates in cold regions 
around the world. Finally, PREMIER would use 
infrared and microwave sounding to measure 
atmospheric composition and temperature in 
the upper troposphere and lower stratosphere, 
a region particularly important for climate. 

No satellites currently in orbit can match the 
sensitivity of the proposed missions. Neither 
NASAs Landsat programme, which has cap- 
tured images of Earth since 1972 (see Nature 
494, 13-14; 2013), nor Japan’s Advanced Land 
Observing Satellite-2 — to be launched later 


. \ now, trees or the air we breathe? Europe's 


Sea-ice 
thickness 
(metres) 


this year — have sensors that can measure 
forest biomass with the precision of Biomass, 
for example. And climate scientists have been 
warning of an impending data crisis after the 
2010 retirement of NASA’s ICESat mission, 
which among other things monitored ice 
sheets and vegetation, and the failure of the 
agency’s Orbiting Carbon Observatory and 
the solar monitoring and aerosol mission Glory 
(see Nature http://doi.org/bqjhn7; 2011). 

Once ESA has decided which it will back, a 
satellite could be ready for launch by the end of 
the decade. Science will be the main criterion 
for the decision, but cost and the maturity of the 
technology will also play a part. The stakes are 
high, says Thomas Stocker, a climate researcher 
at the University of Bern, and a co-chair of a 
working group of the Intergovernmental Panel 
on Climate Change. “Findings such as those 
concerning ice-sheet changes in Greenland and 
Antarctica would have been impossible without 
space observations,’ he says. 

Europe currently has three Earth Explorer 
satellites in orbit: GOCE is mapping tiny vari- 
ations in Earth’s gravity field, which can reveal 
changes in ice mass and ocean circulation; 
SMOS is measuring soil moisture and ocean 
salinity; and CryoSat-2 is monitoring varia- 
tions in sea-ice thickness and changes in the 
mass of large ice sheets and glaciers. Three 
further missions — Swarm, ADM-Aeolus 
and EarthCARE — are scheduled for launch 
over the next three years, and will, respectively, 
monitor Earth’s magnetic field; global wind 
profiles; and clouds, aerosols and radiation. 

The seventh Earth Explorer mission will 


Measurements from ESA’s SMOS mission show the thickness of seasonal sea ice. 
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provide scientists with yet another set of global 
data that are hard to come by on the ground. 
The extent of seasonal snow cover, for instance, 
is an important feedback in climate change 
because snow reflects sunlight, cooling Earth’s 
surface, and affects the supply of fresh water. 

“More than 1 billion people rely on glaciers 
and seasonal snow packs for their water sup- 
ply,’ says Helmut Rott, a meteorologist at the 
University of Innsbruck in Austria and lead 
scientist on the CoReH,0O project. “They will 
want to know what the future might hold for 
them. That’s why we need these data.” 

But data on global forest biomass — a major 
store of land carbon and a key indicator of bio- 
diversity — are no less important, says Shaun 

Quegan, a carbon- 


“They willwant cycle researcher at 
to know what the University of 
thefuturemight Sheffield, UK, who is 
hold for them. part of the Biomass 
That’s why assessment group. In 
we need these particular, he says, 
data.” satellite observations 


are needed to quantify 
global carbon emissions for tropical forests, for 
which no reliable ground inventories exist. 

Supporters of the PREMIER mission are 
just as emphatic. It promises knowledge that 
“js absolutely vital’, says Michaela Hegglin, an 
atmospheric scientist at the University of Read- 
ing, UK, anda member of the mission's advisory 
group. To improve regional climate predictions, 
she says, “we need to better understand how 
the atmospheric circulation responds to rising 
greenhouse-gas concentrations”. 

No clear front runner has emerged. Which- 
ever proposal prevails at the Graz meeting is 
almost certain to be funded by ESA. A final 
decision is expected at the agency’s board meet- 
ing in May in Svalbard, Norway, but ESA has 
backed the verdict of the scientists in the past. 

“Tf I had my way, I would love to see all three 
missions fly,’ Stocker says. But with the eco- 
nomic crisis casting a shadow over Europe's 
Earth- and climate-observation plans, there 
is little chance of that. Spain, France and Italy 
last year reduced their contributions to ESA, 
leaving the agency with €600 million less for 
its 2013-17 Earth-observation budget than 
it had hoped for. The seventh Earth Explorer 
mission is safe, says Volker Liebig, director 
of ESAs Earth-observation programmes. But 
tight budgets are likely to shrink the size and 
ambition of future missions. “We need to make 
future missions cheaper without impairing the 
science,’ he says. = 
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ASTRONOMY 


The death of the 
Chebarkul meteor 


Scientists reconstruct a battered traveller’s final moments. 


BY QUIRIN SCHIERMEIER 


he city of Chelyabinsk was once a secret 
"Tiss weapons centre, then a poor 

Siberian backwater. But a few minutes 
after sunrise on 15 February, the largest meteor 
blast in more than 100 years lifted the region 
from obscurity. Since then, scientists have 
been scrutinizing fragments of the meteorite 
and studying videos of its final moments to pin 
down its origin and how it got to Earth. 

The picture so far is of a garden-variety 
envoy from the asteroid belt between Mars 
and Jupiter, the size of a house and weighing 
9,000 tonnes, that had a hard life even before its 
chance encounter with Earth. “It was a rather 
fragile body,’ says Pavel Spurny, a meteor expert 
at the Ondiejov Observatory near Prague, a 
member of one of the teams doing the analysis. 

His team examined seven videos of the fire- 
ball — the largest since the 1908 Tunguska 
meteorite, which also hit Siberia. The analy- 
sis reveals that the meteor first became visible 
around 92 kilometres above ground. Just over 
11 seconds later, at a height of nearly 32 kil- 
ometres, it exploded spectacularly under 
the stress of heating and air drag, damaging 
thousands of homes and injuring more than 
1,000 people in and around Chelyabinsk. 
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Before its cataclysmic encounter with Earth’s 
atmosphere, the object — dubbed Chebarkul 
after the small town and lake where some of 
the largest fragments have been found — seems 
to have been on an elliptical orbit around the 
Sun. Stretched between Venus and the centre of 
the asteroid belt between Mars and Jupiter, the 
Earth-crossing orbit was just slightly inclined 
relative to the plane of Earth's orbit around the 
Sun, the team says in its report to the Interna- 
tional Astronomical Union’s Central Bureau 
for Astronomical Telegrams in Cambridge, 


IMPACT ZONE 


Massachusetts. (NASA scientists have ruled 
out any relation between the Chebarkul meteor 
and the asteroid 2012 DA14, which skimmed 
by Earth just hours later.) 

A team assigned by the Russian Academy of 
Sciences to comb the snowy countryside has 
collected more than 50 fragments measuring 
0.5-1 centimetre in diameter. A second group, 
led by Viktor Grokhovsky of the Urals Federal 
University in Ekaterinburg, has found another 
50 or so pieces, one of which weighs almost 
2 kilograms. The samples are being sent to the 
Russian Academy of Science's Vernadsky Insti- 
tute of Geochemistry and Analytical Chemis- 
try in Moscow, which has also asked locals for 
photographs of any fragments they have found. 

Inside some of the rocky shards are glassy 
veins, perhaps created during the impact 
that broke the object away from its mother 
asteroid many million years ago. Preliminary 
geochemical analysis suggests that Chebarkul 
was a stony meteor with low iron content — a 
chondrite — made of material that had been 
partially melted and recrystallized from the 
dust and gas cloud of the early solar nebula. 

A sturdier body might have reached the 
surface without exploding, but this one prob- 
ably took a beating after its genesis, having 
collided with other celestial bodies, says 
Timothy Spahr, director of the Smithsonian 
Astrophysical Observatory’s Minor Planet 
Center in Cambridge, Massachusetts. “This 
could have caused cracks which then resulted. 
in such a powerful blast,” says Erik Galimov, 
director of the Vernadsky Institute. 

The search is on for bigger pieces that hold 
more clues to the meteor’s origin and history. 
Spurny and his colleagues have calculated the 
final part of the meteor’s 254-kilometre flight 
through the atmosphere and where the largest 
fragments probably landed (see ‘Impact zone). 

The fireball zipped through the upper 
atmosphere at an initial velocity of 17.5 kilo- 
metres per second, the team thinks. In the 
denser air near the ground, the fragments 
would have slowed to about 180 metres 
per second, cooled and vanished, says Jiri 
Borovicka, a co-author of the report. 

But calculations based on the fireball’s 


Calculations based on the path of the fireball suggest where 
fragments of Russia’s 15 February meteorite may have landed. 


Largest fragment may 
have splashed down 
in Lake Chebarkul. 


Fragments 
weighing several 
kilograms may 
have landed here. 
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Small pieces may have 
showered a region 
located 5 kilometres 
south of the path. 


KAZAKHSTAN 


VALERIY MELNIKOV/RIA NOVOSTI 


observed path and on wind profiles suggest 
that chunks weighing tens of kilograms prob- 
ably landed close to the village of Travniki, 
and one weighing around 1 kilogram may 
have hit northwest of the village of Shchapino, 
Borovicka says. Thousands of smaller pieces 
may be hiding in a 25-kilometre-long swathe 
south of the final point of the trajectory. 


The largest single fragment, a piece of rock 
that could weigh up to halfa tonne, may have 
landed smack in Lake Chebarkul. A 6-metre- 
wide hole found in the lake's frozen surface the 
morning after the impact “almost certainly” 
marks the spot where the whopper came 
down, says Borovicka. 

Russian military divers are busy searching 
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the shallow lake. But Spahr, who discussed the 
impact with other scientists at a meeting of 
the United Nations Committee on the Peace- 
ful Uses of Outer Space in Vienna last month, 
isn’t so sure. “From what we've seen on pic- 
tures, the ‘crater’ just doesn’t look right,’ he 
says. “It looks more like a hole someone has 
cut in the lake with an axe.” m 


Metal oxide chips show promise 


Materials that flip from insulator to conductor could make energy-efficient transistors. 


BY EUGENIE SAMUEL REICH 


he switches in most electronic circuits 

are made of silicon, one of the common- 

est elements. But their successors might 
contain materials that, for now, are lab-grown 
oddities: strongly correlated metal oxides. 

The allure of these materials lies in the outer 
shells of electrons surrounding their metal 
atoms. The shells are incomplete, leaving 
the electrons free to participate in coordi- 
nated quantum-mechanical behaviour. In 
some materials, electrons pair up to produce 
superconductivity, or coordinate their spins 
to produce magnetism. Other materials can 
switch from being an insulator to a conductor. 

Unlike transitions to superconductiv- 
ity, which happen as temperatures approach 
absolute zero, the insulating-to-conducting 
transition typically happens as temperature 
increases, and sometimes occurs near room 
temperature. That has raised hopes that metal 
oxides could be used instead of silicon to make 
transistors. A spate of results is now making 
that look feasible. “People are interested in 
seeing if oxides can make it to applications,” 
says Manuel Bibes, a physicist at the Joint Phys- 
ics Unit in Palaiseau, France, which is run by 
the French National Research Centre and 
electronics company Thales. 

Metal oxide transistors have the potential 
to consume less power than silicon switches, 
because the phase transition frees electrons 
from their localized state near each atom, with- 
out moving them through the bulk material. 
By contrast, silicon switches work by pulling 
electrons through the material to a channel 
where they conduct current (see ‘Go with the 
flow’). 

In the past 5-10 years, researchers have 
succeeded in growing high-quality thin films 
of the metal oxides — overcoming one of the 
major barriers to applications. In July 2012, for 
example, a group in Japan reported’ that it had 
deposited a thin film of vanadium dioxide that 
underwent a phase transition in response to an 


GO WITH THE FLOW 


Metal oxide transistors have the potential to 
consume less power than silicon ones, because 
switching does not require the atoms to be 
relocated. 


GATE OXIDE 


SOURCE DRAIN 


SILICON 


Silicon transistor 

A positive charge at the gate draws electrons 
into a surface channel, allowing current to flow 
between the source and the drain. 
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Metal oxide transistor 

An applied electric field at the gate causes 
electrons on metal atoms to become conductive, 
letting current flow between source and drain. 


applied electric field — proof that the material 
could be used as an electronic switch. 

And last month, a group led by Shriram 
Ramanathan, a materials scientist at Harvard 
University in Cambridge, Massachusetts, 
addressed a fabrication challenge by growing 
a thin film of samarium nickelate on top ofa 
substrate made of silicon and silicon dioxide’. 

The nickelate was deposited at a relatively 
low temperature that did not disturb the 
underlying silicon layers, raising the possi- 
bility of manufacturing metal oxides on top 
of silicon wafers to form three-dimensional 
chips, says Andrew Millis, a solid-state theo- 
rist at Columbia University in New York. Not 
only would that allow computing power to be 
packed much more densely, says Millis, but it 
would also permit metal oxide switches to be 
built on top of existing circuit architectures. 

Other groups are trying to understand the 
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nature of the phase transition. In January, Ivan 
Schuller, a solid-state physicist at the Univer- 
sity of California, San Diego, and his colleagues 
showed’ that in vanadium oxide, the transition 
is in large part caused by micrometre-scale 
heating by the applied electric field. 

Some point to Schuller’s work as evidence 
that metal oxides will never make fast switches, 
because heating effects are usually quite slow. 
But Ramanathan says that his own measure- 
ments on vanadium oxide demonstrate that 
the phase transition is quite fast — less than 
a few nanoseconds — and that it should not 
hinder applications. 

Some physicists are finding further exam- 
ples of potentially useful materials. Bernhard 
Keimer at the Max Planck Institute for Solid 
State Research in Stuttgart, Germany, alternates 
thin layers of metal oxides to form composites 
that often turn out to have serendipitous prop- 
erties. His group layered conducting lanthanum 
nickelate and insulating lanthanum aluminate 
and found’ that the composite underwent a 
transition between the two properties. 

The highest phase-transition temperature 
for the composite was 150 kelvin above abso- 
lute zero — too low for practical applications. 
But the group is now trying to replicate the 
phenomenon in other materials that might 
have higher transition temperatures. 

Sandip Tiwari, an applied physicist at Cornell 
University in Ithaca, New York, acknowledges 
that metal oxides are not yet close to competing 
with silicon. But given recent progress, he feels 
that researchers need to start trying to imple- 
ment them in devices. That way, he says, all the 
properties needed for a good transistor will be 
developed in tandem. “If you just look at what- 
ever property is your favourite, you won't get 
them all? = 
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A successful engine test last month means that the Antares rocket can proceed to its inaugural launch. 


SPACE FLIGHT 


Commercial space 
race heats up 


Antares test could challenge dominance of Falcon 9 rocket. 


BY DEVIN POWELL 


he Falcon 9 rocket, which made its 
sk successful flight on 1 March, has 

stolen the spotlight in the commercial 
space race. Built by SpaceX, a young company 
based in Hawthorne, California, the rocket has 
become NASAs choice for hauling cargo to the 
International Space Station (ISS). But it may 
soon have competition from a rocket that has 
kept a low profile (see ‘Battle of the rockets’). 

After years of delays, Orbital Sciences of 
Dulles, Virginia, has slated the first test flight 
ofits Antares rocket for April. If that goes well, 
its second mission could carry an unmanned 
Cygnus spacecraft to the ISS within months. 
“There’s no one main problem, no show- 
stopper,” says Orbital spokesman Barron 
Beneski. “In hindsight, this has just taken us 
longer to do than we thought it would” 

Both companies have received hundreds 
of millions of dollars from NASA’s Commer- 
cial Orbital Transportation Services (COTS) 
programme. With the space shuttle retiring in 
2011, the agency wanted alternatives to paying 
for ISS deliveries aboard the Russian Progress 
and Soyuz craft. NASA deliberately put two 
companies in competition with each other 
to keep prices down over the long run and to 
attract other customers. “The government is 
the necessary anchor tenant for commercial 
cargo, but it’s not sufficient to build a new 
economic ecosystem,” says Scott Hubbard, an 
aeronautics researcher at Stanford University in 
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California and former director of NASAs Ames 
Research Center in Moffett Field, California. 
With 30 years of experience in making satel- 
lites and rockets, Orbital once seemed the safer 
bet. Instead of assembling its vehicles from 
scratch like SpaceX, Orbital uses parts made 
by companies with proven track records. The 
core of the first stage of Antares was designed 


BATTLE OF THE ROCKETS 


Next month, Orbital Sciences plans to launch 
its Antares rocket, which could provide an 
alternative to SpaceX’s Falcon 9 for supplying 


the International Space Station. 
FALCON 9 


First launch 
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launches 
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low-Earth orbit 
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and built by veterans KB Yuzhnoye and 
Yuzhmash, both based in Dnipropetrovsk, 
Ukraine. Cygnus’s sensors come from Mit- 
subishi Electric in Tokyo and its pressurized 
cargo module was built at a Thales Alenia 
Space plant in Turin, Italy. “Orbital used more 
heritage technology,” says Alan Lindenmoyer, 
manager of NASA’s commercial crew and 
cargo programme. “That was less risky for us.” 

But the company did not enter COTS until 
2008, two years after SpaceX. With the clock 
ticking, NASA allocated less money for Orbital 
and ordered a simpler ship. Unlike SpaceX’s 
Dragon capsule, Cygnus can't carry sensitive 
biological experiments, such as those that grow 
protein crystals in microgravity. It burns up 
on re-entry, so it can't return samples to Earth. 
And it can’t be modified to carry humans. 

Nor has it yet flown. Orbital chose to launch 
from NASAs Wallops Flight Facility in Virginia; 
less crowded than Cape Canaveral in Florida, 
which hosts most NASA rocket launches, Wal- 
lops usually caters for smaller vehicles such as 
scientific balloons and sounding rockets. The 
facility’s Mid-Atlantic Regional Spaceport had 
to build a new launch pad for Antares, which 
took longer than expected. Originally scheduled 
for 2010, the demonstration launch slipped to 
2012, and then to 2013, after Hurricane Sandy 
hit the spaceport last October. 

Antares’ engines, built half a century ago 
for Russia's Moon programme and recently 
refurbished, have also proven finicky. A test 
on 13 February was aborted when pressure 
anomalies were detected in one of the engines. 
A successful test on 22 February means that 
Orbital can now proceed to a launch in April. 

Having another cargo carrier would provide 
a safety net for the ISS. That need was made 
clear when Dragon’s thrusters initially failed 
during the flight last week, almost preventing 
the spacecraft from docking with the station. 
But there is scepticism about whether competi- 
tion will drive down prices. “This is a mixed- 
up crazy business and a small market that isn’t 
all that price sensitive,” says Henry Hertzfeld, 
a space-policy expert at George Washington 
University in Washington DC. 

Factors such as political or military concerns 
often shape demand, he says. And reliability 
is as much ofa concern as price. The market 
could grow: private space stations, orbiting fuel 
depots and asteroid-mining operations have all 
been proposed as future clients for commer- 
cial rockets. But for now, supplying the ISS and 
launching satellites are the major niches. 

SpaceX is making inroads with the latter: 
it has contracts for launches with the US Air 
Force and commercial operators in Israel, Thai- 
land, Luxembourg and Mexico. Orbital, too, 
wants to develop business beyond NASA, but 
it has not yet booked any launches. “Right now, 
were focused on getting the rocket launched,” 
says Frank Culbertson, executive vice-presi- 
dent of Orbital’s advanced-programmes group. 
“Then we can talk to other customers.” m 
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Behind the scenes of a 
brain-mapping moon shot 


Critics fear that a proposed megaproject could crowd out other biological research. 


BY MEREDITH WADMAN 


on Kalil, a neuroscientist at the 
R tines of Wisconsin-Madison, 

didn’t expect to see his son among the 
28,500 attendees at the meeting of the Society 
for Neuroscience in New Orleans last Octo- 
ber. And he wondered why Tom Kalil, deputy 
director for policy at the White House’s Office 
of Science and Technology Policy (OSTP), 
was accompanied by Miyoung Chun, vice- 
president of science programmes at the Kavli 
Foundation in Oxnard, California. 

Tom Kalil told his father that the Kavli 
Foundation had wanted his help in bringing 
nanoscientists together behind an ambitious 
idea. Ron Kalil says he thought: “Why are you 
talking about it at a neuroscience meeting?” 

He understands now. These two people, 
neither of them a working scientist, had 
been quietly pushing into existence the Brain 
Activity Map (BAM), the largest and most 
ambitious effort in fundamental biology since 
the Human Genome Project — and one that 
would need advances in both nanoscience and 
neuroscience to achieve its goals. 

This is the kind of science — big and bold — 
that politicians like. President Barack Obama 
praised brain mapping in his State of the Union 
address on 12 February. Soon after, Francis 
Collins, director of the US National Institutes 
of Health (NIH) in Bethesda, Maryland, which 
will be the lead agency on the project, talked up 
the idea in a television appearance. 

The Obama administration is expected 
to provide more details about the initiative 
this month, possibly in conjunction with the 
release of the federal 2014 budget request. But 
already, some scientists are wondering whether 
the project, a concept less than two years old 
and still evolving, can win new funding from 
Congress, or whether it would crowd out pro- 
jects pitched by individual scientists. “Creative 
science is bottom-up, not top-down,’ says Cori 
Bargmann, a neurobiologist at the Rockefeller 
University in New York. “Are we talking about 
central planning inside the Beltway?” 

The idea was born at a meeting of neurosci- 
entists and nanoscientists that Chun helped to 
convene in September 2011 at the Kavli Royal 
Society International Centre at Chicheley Hall, 
an elegant country house in Buckinghamshire, 
UK. There, a handful of scientists drafted 


a white paper laying out their ambitions to 
illuminate what they would soon call the 
“impenetrable jungles” of brain function, by 
mapping and stimulating neural circuits with 
cellular and millisecond-level resolution. 
Functional magnetic resonance imaging can 
provide the big picture, and electrodes can 
trigger single and small groups of neurons, 
but this project called for the mapping of thou- 
sands of neurons simultaneously, using new 
tools such as nanoprobes and optogenetics, 
which manipulates neurons using light. Chun 
suggested sending the paper, which eventu- 
ally appeared in Neuron (A. P. Alivisatos et al. 

Neuron 74, 970-974; 2012), to the OSTP. 
Chun, a molecular geneticist known for 
being extraordinarily persistent, had already 
been in touch with Tom Kalil, who had been 
the prime mover 


“Creative behind a major nano- 
science 1s technology initiative 
bottom-up, during the Clinton 


administration. Kalil 
had urged Chun to 
bring him “out-of-the-box, great and progres- 
sive ideas’, she recalls. 

In December 2011, Chun brought the 
Chicheley Hall group to Washington DC to 
present its vision to the OSTP. Led by Rafael 
Yuste, a biologist at Columbia University in 
New York and co-director of the Kavli Institute 
for Brain Science, the group at first had mod- 
est plans. They would deploy the new tools in 
simple model organisms — worms, flies and 
leeches — before graduating to zebrafish, mice 
and rats. 

When the scientists returned to Wash- 
ington in May 2012, OSTP officials “wanted 
us to think bigger’, Yuste says. They pushed 
the group to expand the project to human 
brains — and to the study of possible inter- 
ventions for brain disorders. 

That goal has some observers cautioning 
against the selling of a basic-science project as 
a way to find cures for Alzheimer’s disease and 
Parkinson’s disease. “Don’t overpromise,’ says 
Arthur Caplan, a medical ethicist at the New 
York University Langone Medical Center. “The 
genome mapping I think was worth doing. But 
it has left many in Congress and some on the 
industry side saying: ‘where are the goodies?” 

With the genome project, the goal was rela- 
tively fixed: a sequence of 3 billion base pairs. 


not top-down.” 
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But Donald Stein, a neuroscientist at Emory 
University in Atlanta, Georgia, points out that 
brain anatomy and chemistry are constantly 
shifting. Mapping structure and measur- 
ing electrical activity at a particular moment 
doesn’t necessarily mean that answers will 
“miraculously appear’, he says. 

Critics also wonder who will pay for the 
project at a time when budgets are stretched 
thin. The NIH has US$545 million in its Com- 
mon Fund, a flexible source of money for new 
projects — but last year, all but $30 million of 
it was tied up in existing projects. The BAM 
would probably cost several billion dollars over 
the course of a decade or more. “What moti- 
vates people to pursue these big projects is not 
the belief that they will solve problems,” says 
Michael Eisen, a biologist at the University of 
California, Berkeley. “It’s the belief that this is 
the way to get money.” 

However, two other agencies are expected to 
help support the project: the National Science 
Foundation and the Department of Defense, 
which has an interest in treatments for trau- 
matic brain injuries and in mind-controlled 
prosthetics. And private foundations, such 
as the Howard Hughes Medical Institute in 
Chevy Chase, Maryland, have already spent 
hundreds of millions of dollars on brain- 
mapping research. Yuste wants to leverage 
that work, and make BAM a private-—pub- 
lic partnership rather than a competition 
between the government and private funders, 
as the genome project was. “We are trying to 
learn from the Human Genome Project, the 
mistakes they made,’ says Yuste. 

Whatever becomes of the BAM, one thing is 
clear: it has a powerful advocate in the Obama 
administration. Ron Kalil recalls once asking 
his son what he actually did in the OSTP. His 
answer was simple: “I’m the make-it-happen 


guy.’ a 


CORRECTION 

The News story ‘Tusk tracking will tackle 
illegal trade’ (Nature 494, 411-412; 
2013) gave the wrong dates for the ivory 
seizures in the map — they were all 2006. 
It also omitted a reference to the statistical 
models: Wasser, S. K. et al. Proc. Natl Acad. 
Sci. USA 104, 4228-4233 (2007). 
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some progress, women scientists are still paid 

less, promoted less, win fewer grants and are more 
likely to leave research than similarly qualified men. 
The reasons range from overt and covert discrimina- 
tion to the unavoidable coincidence of the productive 
and reproductive years. 

In this special issue, Nature takes a hard look at the 
gender gap and at what is being done to close it. A sur- 
vey of the data (see page 22 and go.nature.com/ytmyhf) 
reveals where progress has been made and where ine- 
qualities still lie, from salary to tenure. A News Feature 
(see page 25) reveals a particular dearth of women in 
some commercial spheres, such as on the scientific 
advisory boards of biotechnology firms, and an arti- 
cle by historian Patricia Fara (see page 43) traces the 
wearying stereotypes perpetuated by the biographers 
of women scientists. 

A series of Comment articles looks at possible solu- 

. tions. Neuroscientist Jennifer Raymond (see page 33) 
calls on both sexes to recognize and reduce their biases 
against women in science, and eight researchers from 
around the world offer their prescriptions (see page 35), 
from equalizing the retirement age in China, to liberal- 
izing travel restrictions in Saudi Arabia, to boycotting 
conferences that lack female speakers. We catalogue 
some of the ambitious moves being made in Europe 
to get more women into top positions (see page 40) 
and explore some surprising statistics about mandatory 
quotas (see page 39). Finally, profiles of four successful 
30-something women (see page 28) show how ambition 
and talent can trump obstacles. 

This special issue is dedicated to the memory of 
Maxine Clarke. In the 28 years that Maxine spent cham- 

te | pioning the highest scientific standards as an editor at d ‘ 
bas pi Nature, she was all too often the only one to ask, “Where / 
. are the women?” m 


cience remains institutionally sexist. Despite Ik \ 


eet 


ee a ee a | 


Bis 


WOMEN IN SCIENCE 


The gender gap and how to close it 
nature.com/women 


IMAGE: VIKTOR KOEN 


7 MARCH 2OI3 | VOL 495 | WATURE 27 


| NEWS | FEATURE 


MIND Tit GENDER GAP 


Despite improvements, female scientists continue to face 
discrimination, unequal pay and funding disparities. 


s an aspiring engineer in the early 
1970s, Lynne Kiorpes was easy to spot 
in her undergraduate classes. Among 
a sea of men, she and a handful of 
other women made easy targets for a 
particular professor at Northeastern 
University in Boston, Massachusetts. On the 
first day of class, “he looked around and said 
‘I see women in the classroom. I don't believe 
women have any business in engineering, and 
I'm going to personally see to it that you all fail” 

He wasn't bluffing. All but one of the women 
in the class ultimately left engineering; Kiorpes 
went on to major in psychology. 

Such blatant sexism is almost unthinkable 
today, says Kiorpes, now a neuroscientist at 
New York University. But Kiorpes, who runs 
several mentoring programmes for female stu- 
dents and postdoctoral fellows, says that subtle 
bias persists at most universities. And it drives 
some women out of science careers. 

By almost any metric, women have made 
great gains in closing the scientific gender 
gap, but female scientists around the world 
continue to face major challenges. Accord- 
ing to the US National Science Foundation, 
women earn about half the doctorates in 
science and engineering in the United States 
but comprise only 21% of full science profes- 
sors and 5% of full engineering professors. And 
on average, they earn just 82% of what male 
scientists make in the United States — even 
less in Europe. 

Scientific leaders say that they continue to 
struggle with ways to level the playing field 
and entice more women to enter and stay in 
science. “We are not drawing from our entire 
intellectual capital,” says Hannah Valantine, 
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dean of leadership and diversity at the Stan- 
ford School of Medicine in California. “We’ve 
got to put on the accelerator to evoke social 
change.” 

One of the most persistent problems is that a 
disproportionate fraction of qualified women 
drop out of science careers in the very early 
stages (see “Women in science’). A 2006 sur- 
vey of chemistry doctoral students by the Royal 
Society of Chemistry in London, for example, 
found that more than 70% of first-year female 
students said that they planned a career in 
research; by their third year, only 37% had that 
goal, compared with 59% of males’. 

Many experts say that a big factor driving 
this trend is the lack of role models in the 
upper divisions of academia, which have been 
slow to change. The Royal Society of Chemis- 
try has found, for instance, that female chem- 
istry students are more likely than males to 
express low self-confidence and to report dis- 
satisfaction with mentorship’. Female students 
“conclude consciously and unconsciously that 
these careers are not for them because they 
don't see people like them’, suggests Valantine. 
“That effect is very, very powerful — this sense 
of not belonging.” 

The attrition continues at later stages. In 
biology, for example, women comprised 36% 
of assistant professors and only 27% of tenure 
candidates in a 2010 study by the US National 
Research Council’. “We're not talking about 
a lack of talent here. Part of the story is that 
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women leave earlier. In a sense, they give up 
on an academic career,’ says Curt Rice, vice- 
president of research and development at the 
University of Tromso in Norway, who has 
studied gender equality in US and European 
universities. 


FAMILY VALUES 

Many of the UK chemistry students viewed 
research as an all-consuming endeavour that 
was incompatible with raising a family. Meet- 
ing the demanding schedule of academic 
research can seem daunting for both mothers 
and fathers. But family choices seem to weigh 
more heavily on the career goals of women. 

Law professor Mary Ann Mason at the 
University of California, Berkeley, and her 
colleagues have found’ that male and female 
postdocs without children are equally likely 
to decide against research careers, each leav- 
ing at a rate of about 20%. But female postdocs 
who become parents or plan to have children 
abandon research careers up to twice as often 
as men in similar circumstances. 

“The plan to have children in the future, or 
already having them, is responsible for an enor- 
mous drop-off in the women who apply for 
tenure-track jobs,” says Wendy Williams, a psy- 
chologist at Cornell University in Ithaca, New 
York. Furthermore, women who do become 
faculty members in astronomy, physics and 
biology tend to have fewer children than their 
male colleagues — 1.2 versus 1.5, on average — 
and also have fewer children than they desire’. 

In response to these concerns, many uni- 
versities have taken steps to establish family- 
friendly policies such as providing child-care 
assistance and extending tenure clocks for new 
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WOMEN IN SCIENCE: KEY 
MANY HURDLES AHEAD 


The number of women studying and practising science has risen sharply, but women are 
disproportionately driven away from scientific careers. 


GRADUATE SCHOOL 


The fraction of women gaining doctorates in science has more than doubled in the United States since 1980 and is now nearing equity. In some 
European countries, women outnumber men in science degrees but there is significant variation between nations and fields. 


US FEMALE DOCTORAL RECIPIENTS IN SCIENCE AND ENGINEERING FEMALE DOCTORAL RECIPIENTS IN SCIENCE IN EUROPE (2006) 
Lithuania Italy Spain Belgium 
France Germany The Netherlands 


POSTGRADUATE POSITIONS 


A 2009 survey of postdoctoral fellows at the University of California showed that women who had children or planned to have them 
were more likely to consider leaving research. 


POSTDOCS WHO DECIDED AGAINST CAREERS AS RESEARCH FACULTY MEMBERS (2009) 
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Female representation among science and engineering faculty members in the United States has lagged behind gains in graduate education, in 
part because many women do not apply for tenure-track jobs. But women who do apply are more likely than men to receive interviews and offers. 
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RISING IN THE RANKS 


A study of US science departments showed that women were more successful than men in gaining tenure between 2002 and 2004. 
In Europe as in the United States, the gender gap is greater among senior than among junior faculty members. 
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THE FUNDING GAP 


Women are earning an increasing share of research grants from the US National Institutes of 
Health (NIH) but the average size of their awards has consistently lagged behind what men receive. 


KEY 
FD 


2002 2002 
NUMBER OF NIH RESEARCH GRANTS 24 Y AVERAGE SIZE OF GRANT 
ee 31,301 heat 0 EN $403,047 
Utena Say 
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2012 2012 
NUMBER OF NIH RESEARCH GRANTS 3 0 vA AVERAGE SIZE OF GRANT 
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ME 13,025 moponuen going $421,385 
to women 

THE SALARY GAP 
Female scientists in the United States earn much less than men, on average, with the difference varying strongly by field. 
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parents. Shirley Tilghman, president of Prince- 
ton University in New Jersey, believes that such 
initiatives provide crucial support for women, 
but that other solutions are still needed. “I 
don't think there's a single obstacle,” she says. 
“T think there’s a whole series of phenomena 
that add up:” 


LIVE ISSUE 

At Yale University in New Haven, Connecticut, 
microbiologist Jo Handelsman is one of many 
researchers who think that gender discrimina- 
tion continues to be a significant part of the 
problem. In a much-talked-about experiment 
last year®, her team showed that science faculty 
members of both sexes exhibit unconscious 
biases against women. Handelsman’s group 
asked 127 professors of biology, chemistry 
and physics at 6 US universities to evaluate the 
CVs of two fictitious college students for a job 
as a laboratory manager. The professors said 
they would offer the student named Jennifer 
US$3,730 less per year than the one named 
John, even though the CVs were identical. The 
scientists also reported a greater willingness to 
mentor John than Jennifer. “If you extrapolate 
that to all the interactions that faculty have 
with students, it becomes very frightening,” 
says Handelsman. 

Her findings match well with the results of 
asurvey’ done in 2010 by the American Asso- 
ciation for the Advancement of Science. Of 
the 1,300 or so people who responded, 52% of 
women said that they had encountered gender 
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2008 median salaries 


AVERAGE PAY GAP 
ALL POSITIONS 


2008 median salaries 


bias during their careers, compared with just 
2% of men. 

Still, other concrete evidence of bias is hard 
to find. Some measures show female scientists 
outperforming male rivals in landing inter- 
views and job offers early in their careers. The 
National Research Council study’ showed that 
women accounted for 19% of the interview 
pool and received 32% of job offers for tenure- 
track electrical-engineering positions. Women 
fared just as well as men in tenure evaluations, 
but female assistant professors in many disci- 
plines seemed less likely to reach tenure con- 
sideration compared with men. 

Women face even more daunting odds in 
Spain. Men are 2.5 times more likely to rise 
to the rank of full professor than female col- 
leagues with comparable age, experience and 
publication records’. 

Disparities can also be found in grant fund- 
ing in some countries. In one frequently cited 
study’, Christine Wenneras and Agnes Wold at 
the University of Gothenburg in Sweden found 
in 1997 that female applicants for postdoctoral 
fellowships had to score 2.5 times higher on an 
index of publication impact to be judged the 
same as men. 

Several groups, such as the UK Medical 
Research Council and biomedical research 
charity the Wellcome Trust, have since inves- 
tigated their grant programmes and found 
negligible or very subtle effects of gender”. The 
Canadian Medical Research Council found no 
differences in success rate in most of its research 
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grant programmes, but reported lower success 
rates for women in some training grants". In 
the United States, women are slightly more suc- 
cessful than men in obtaining grants from the 
National Science Foundation, but the trend is 
reversed for the National Institutes of Health 
(NIH). The NIH also gives women smaller 
awards on average (see “The funding gap’). 

Information provided to Nature by the NIH 
through a Freedom of Information Act request 
indicates that the percentage of women on 
review panels has improved marginally over 
the past decade, from 25% in 2003 to 30% in 
2012. Those figures roughly parallel the per- 
centage of women applying for and receiving 
grants in that time. 


PAY PROBLEMS 

The inequalities also extend to salaries. In the 
European Union, female scientists earned on 
average between 25% and 40% less than male 
scientists in the public sector in 2006 (ref. 12). 
Although the average pay gap is smaller in the 
United States, the disparity is particularly large 
in physics and astronomy, where women earn 
40% less than men. 

For young academic scientists, however, 
those differences may be fading. The National 
Research Council found an 8% pay gap at the 
level of full science and engineering professors 
but no significant differences among junior fac- 
ulty members’. Some experts argue, however, 
that the salary gap may reflect other continued 
trends, such as the fact that a disproportionate 
share of women move into non-tenure posi- 
tions or faculty jobs at lower-status universities. 

Tilghman says that Princeton and many 
other universities have grown increasingly 
conscious of the need to track and rectify gen- 
der gaps in salary and other institutional sup- 
port. “Absolutely, it needs eternal vigilance,” 
she says. “But we're in a much better place? m 


Helen Shen is an intern with Nature in 
Washington DC. 
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BARRED FROM THE 


BOARDROOM 


The number of women in 


scientific research is going up — 
but where academia crosses 


a 


into industry, 


=~ 


BY ALISON MCCOOK 


ancy Hopkins started Googling her 
colleagues in spring 2012. She mentally 
scanned the hallways of her institu- 
tion at the Massachusetts Institute of 
Technology (MIT) in Cambridge — along 
with the campuses of other elite institutions 
— for the offices of men she knew who had 
founded companies. 


and women serving on their scientific advisory 
boards (SABs), a prestigious position for 
researchers who steer the company’s scientific 
direction. 

It was an informal exercise, rather than a 
systematic survey. But Hopkins, a molecular 
biologist at MIT and a long-time campaigner 
for women in science, 


Then she clicked on 
the websites of their 
firms, and counted 
the number of men 
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found the results 
shocking. A sample of 
12 of the companies 
she examined had a 
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total of 129 SAB members; only 6 were women. “I was completely 
stunned,’ says Hopkins. “And it made me sad. I thought, ‘gee, why 
don’t these men want to work with [MIT] women?’ We have such 
incredible women faculty.” 

The proportion of women in industrial and academic science 
has shot up over the past 20 years. According to the US National 
Science Foundation, women make up 25% of tenured academics 
in science and engineering and more than 25% of industry scien- 
tists in research and development. But when it comes to academ- 
ics engaging in commercial work — patenting their discoveries, 
starting biotech companies or serving on SABs — the picture is less 
progressive. Studies have confirmed Hopkins’ impression that even 
leading female scientists are often absent from these roles. “The 
secret club [of men] used to be 
going to the lab and conferences,’ 
says Fiona Murray, who studies 
life-sciences entrepreneurship at 
MIT. “That world has changed 
a lot, but we have a new venue 
where it is still difficult for women 
to play a similar role.’ 

Experts in industry and aca- 
demia speculate that the disparity 


INEQUALITY ON BOARD 


40: 


could reflect the small numbers ® 

of women in certain specialized 8 

fields; the demands of family life; & 

or a residual male clubbiness. 

Whatever the reasons, this stub- 10---~ 


born gender gap hurts everyone, 
says Bonnie Bassler, a molecular 
biologist at Princeton University 
in New Jersey. “I think the com- 
panies would do better science by 
having the best people on their 
board. And I think these women, who are great scientists, would 
do better science in their labs by having access to these ideas.” 
“Everybody’s losing,” says Bassler. 
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HIDDEN PROBLEM 

For much of the 1980s and 1990s, there were more than 11 men 
for every one woman in the science faculty at MIT. Things started 
to change 20 years ago, when Hopkins, as the first chair of MIT’s 
Committee on Women Faculty in the School of Science, and her 
team drove through major increases in the hiring of women. By 
2006, one out of every five biology faculty members on the MIT 
campus was a woman. 

Ata dinner last April to honour these achievements and mark 
her retirement from the lab, Hopkins spoke about the work still to 
be done. She talked about a list she had been given by a graduate 
of Harvard Business School in Boston, Massachusetts, showing 
the names of scientists in the area who had received funding from 
a local venture-capitalist firm. Among 100 names, only one was 
a woman. The list would not have surprised Hopkins more than 
30 years ago, when she had been told by a colleague that “women 
aren't allowed” to found biotech companies. But to see sucha dearth 
of academic women in modern biotechnology was upsetting. 

Around that time, Hopkins embarked on her Google search. 
She was particularly interested in SABs because they consist 
mainly of working scientists who are often invited by the com- 
pany’s academic founders — a social process that could reveal 
conscious or unconscious biases against female academics. And 
membership in advisory boards comes with advantages: it can tip 
members off to promising tools and areas of research, and lead to 
other lucrative prospects, such as consulting. Plus, for a few meet- 
ings per year, board members are paid a sometimes-substantial 
fee, given stock options, or both. 
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The proportion of women on biotech scientific advisory boards 
(SABs) lags behind the proportion of eligible female candidates. 


™ Proportion of women on SABs 
@ Proportion of female PhD holders 
in matched sample of scientists 


Year PhD earned 


The first name Hopkins looked up was Eric Lander, found- 
ing director of the Broad Institute of MIT and Harvard. She 
typed “Eric Lander companies” into the search engine. Scrolling 
through the results, she came upon Verastem, a cancer stem-cell 
company founded in 2010 by Lander and others, including Robert 
Weinberg, a cancer researcher at the Whitehead Institute in Cam- 
bridge. She counted 14 people on Verastem’s SAB; all were men. 

Entering “Phil Sharp companies” brought up Alnylam Pharma- 
ceuticals, a Cambridge-based firm co-founded by the Nobel prize- 
winning molecular biologist at MIT in 2002. The company, which is 
developing therapies based on RNA interference, had one woman 
onits 11-person SAB. “Bob Langer companies” yielded a handful 
of the 20-plus firms that the MIT bioengineer has helped to launch, 
including Taris Biomedical in 
Lexington, Massachusetts, which 
focuses on genitourinary condi- 
tions, and the biopharmaceutical 
company Blend Therapeutics in 
Watertown, Massachusetts. Nei- 
ther SAB included any women. 
(Weinberg and Lander say that 
they were not involved in select- 
ing the SABs at Verastem, and 
Langer that he was not involved 
with the process at Blend or Taris. 
Sharp says that at Alnylam, choos- 
ing the SAB required “agreement 
between” the founders, chief 
executive, venture capitalists and 
other people already brought into 
the company.) 

Hopkins included in her search 
a few scientists from other insti- 
tutions, such as Harvard Univer- 
sity in Cambridge and Memorial Sloan-Kettering Cancer Center 
in New York. Overall, among the full-time professors affiliated 
with a sample of 14 companies she reviewed, only 5% of found- 
ers or SAB members were women. Although boards change over 
time, that fraction was much the same as of last month. 

Last July, Hopkins began circulating her results to a handful 
of faculty members at MIT and to scientists further afield. Vicki 
Sato, a professor of biology and management at Harvard with 
along career in the biotechnology industry, says she could not 
believe what she was seeing. “I was stunned by the sampling she 
had done, and told her she had to be wrong,” says Sato. “But I 
knew deep down she was right.” 
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GLOBAL CONCERN 

More rigorous studies have reached similar conclusions. In a paper 
published last October’, Murray, Toby Stuart at the University of 
California, Berkeley, and Waverly Ding at the University of Mary- 
land in College Park reviewed all publicly available lists of US 
biotech SABs, starting in the 1970s and including about 500 com- 
panies. Although women represented between 12% and 30% of 
academically active PhD holders over that time period, the per- 
centage of women on SABs never exceeded 10.2% (see ‘Inequality 
on board’). Even when the researchers compared male and female 
faculty members with similar levels of achievement, measured by 
factors such as publication and citation counts, male scientists were 
roughly twice as likely to join SABs as female ones. 

SABs are not the only commercial forum in which academic 
women seem to be disadvantaged. US women also receive pat- 
ents about 40% as often as men’, start businesses half as often! 
and receive significantly less funding for the start-ups that they 
do launch’. This is not just a US problem: a study released in 
April 2012 by the Royal Society of Edinburgh found that women 
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are underrepresented on the boards of UK science, technology, 
engineering and mathematics companies’. That is despite the fact 
that including women seems to be beneficial: a 2012 report from 
Credit Suisse in Zurich, Switzerland, found that worldwide, com- 
panies with women on the board have higher share prices than 
those with all-male boards’. 


INVITATION ONLY 

So what is going on? For SABs, Hopkins thinks that the answer is 
simple: women are not asked. When she noticed the stark patterns 
in board memberships, Hopkins asked some of her female col- 
leagues — including one she believed was an “absolute star” — if 
they had ever been invited to serve on boards. All of them said 
no. “In the end, these stories are very sad,’ says Hopkins. “People 
know they’re excluded, and it’s costly profes- 
sionally. They're embarrassed to talk about it. 
It’s like not being asked to dance.” 

But the picture is not so simple, says Paul 
Schimmel, a former colleague of Hopkins who 
is now based at the Scripps Research Institute 
in La Jolla, California, and is a co-founder of 
Alnylam. He says that he has tried to ensure 
equal gender participation in his lab and his 
companies for the past 20 years. “There's no 
lack of effort, I tell you,” says Schimmel. But 
serving on a board “can be a lot of work” — 
conference calls, e-mails, travel several times 
a year and thick documents to review — and 
women often bear the majority of domestic 
work and child care. At least one woman has 
turned down Schimmel’ invitation to serve on 
an SAB because of family responsibilities, he 
says. Indeed, research has shown that female academics with chil- 
dren are less likely than those without to patent their discoveries®. 

Some prominent female scientists disagree. Carolyn Bertozzi, 
achemical biologist at the University of California, Berkeley, who 
has two young children and one on the way, says that she is always 
willing to make time to serve on the research advisory board at 
GlaxoSmithKline, which entails attending two-day meetings 
twice a year for “generous” compensation. The meetings teach her 
about what it takes to make a drug, including medicinal chemis- 
try, regulatory issues and intellectual property; that helps with her 
start-up, Redwood Bioscience in Emeryville, California, which 
has two female SAB members out of four. Bertozzi acknowl- 
edges that her situation is unusual: her female partner is a stay- 
at-home mother. But Bassler, too, says that the work involved in 
SABs is worth the sacrifices. “If I were asked to serve on a board, 
I wouldn't do something else,” she says. Bassler has been invited 
to serve on two SABs in her career, but “of course” would accept 
another invitation ifit arose. 

Research seems to support the idea that it is a lack of invita- 
tions — not a lack of time — that reduces female membership in 
biotech SABs. Murray, Stuart and Ding found that both men and 
women tend to join SABs on average around the 20th year after 
completing their PhDs’ — often a time when the major strain of 
child rearing is over. This suggests that family obligations are not 
holding back women more than men. And in interviews at a lead- 
ing institution that Murray declined to name, women consistently 
reported they had rarely been invited to serve on their colleagues’ 
SABs — which was not the case in a matched sample of men’. 

Stuart says that the disparity is most likely to bea result of social 
connections and unconscious bias among men. “If you're male, 
you're slightly more comfortable shooting the shit with your male 
colleagues, and they're who come first to mind when youre put- 
ting these boards together. You may assume — ‘oh, she’s got two 
kids, she's not going to be interested’ — and then not invite her” 


“People know 
they're excluded. 
They're 
embarrassed to 
talk about it. It’s 
like not being 
asked to dance.” 


But companies say that they can have difficulty finding women 
with the right experience, because there are fewer women than 
men in academia overall. At Alnylam, says Schimmel, the type of 
science and the diseases it hopes to treat “considerably narrow the 
size of the pool of highly qualified senior investigators, regard- 
less of gender”. (A statement from the company notes that women 
represent “nearly 30%” of Alnylam’s management team.) At Taris, 
says Langer, the SAB had to include mostly clinical experts in urol- 
ogy, who are generally men. And Verastem found that there were 
few prominent female biologists who focus on cancer stem cells, 
says chief medical officer Joanna Horobin. At least one woman 
declined the offer to join the SAB, Horobin says, because she was 
already working with a competing company. 

The academics and biotech companies interviewed for this 
story say that they hope the situation will 
change. At Alnylam, people have “discussed 
openly the issue of gender and the SAB’, says 
Schimmel. “All of us support strongly the 
idea of addressing the ‘gender problem in a 
thoughtful way and are actively working on 
it” In Lander’s opinion, more important than 
the make-up of the SAB is the selection of the 
company’s board of directors — who “control 
the entire company”. Two out of seven direc- 
tors at Verastem are women. 

Women can also make the first move, says 
Helen Blau, a stem-cell biologist at Stanford 
University in California, who has served on the 
advisory boards for several start-ups. She broke 
into commercialization by patenting discover- 
ies and talking to companies at conferences 
about her work. The effort paid off: companies 
have licensed at least a dozen of her patents, which helped Blau to 
get consulting jobs, board invitations and now her own start-up, 
Didimi in Berkeley, California. 

Hopkins, meanwhile, has not let the issue lie. After she discussed 
her data with MIT colleagues, the group decided to forward the 
findings to the university's provost, Chris Kaiser. It turned out that 
Lydia Snover, director of institutional research at MIT, had already 
started mining faculty CVs across the entire institution for infor- 
mation about activities such as patenting, technology licensing 
and participation in SABs. If MIT finds gender differences and can 
help to do something about them, it will, says Snover. “We want all 
[faculty members] to be involved in the same way.’ 

Hopkins wants to see all institutions follow MIT’s example. In 
academia, people used to believe that “time would fix things natu- 
rally’, and that women would eventually move up the ranks, she 
says — and this attitude may still exist when it comes to academics 
moving into industry. “T think [the gender disparity in SABs] is 
what universities would look like if we hadn't stopped, analysed 
what was going on, and changed it. If you don’t put attention to 
it, it doesn't happen.” = 


Alison McCook is a freelance writer and editor in Philadelphia, 
Pennsylvania. 
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KAY TYE: POWER MOVER 
The neuroscientist break-dancing down the 
tenure track. 


BY HEIDI LEDFORD 


eing five months pregnant comes with a series of conces- 

sions: no booze, no sushi, no double-shot espressos. Less 

appreciated, perhaps, is the havoc it can wreak on a break- 

dancer’s moves. “My dancing is definitely limited now,’ 

says Kay Tye, neurobiologist, award-winning b-girl and 
assistant professor at the Picower Institute for Learning and Memory 
at the Massachusetts Institute of Technology (MIT) in Cambridge. “I 
cant do windmills — I can’t do anything that might cause me to fall. 
Which is, like, everything” 

It is one of the few limitations that Tye, 31, has been willing to accept. 
Striving to make her mark in optogenetics, one of the hottest fields in 
neuroscience, Tye thought nothing of working past midnight, getting 
by on four or five hours sleep a night and maintaining a constant, trans- 
continental travel schedule. She has had to dial back a little in recent 
weeks, and she knows that life may change further once her daughter is 
born. But she is ready. “I’ve been preparing for this my entire life,” she 
says. “I chose a career path that’s family friendly” 

An assistant professorship at MIT, where the tenure rate hovers at 
around 50% and the faculty is still about 80% male, may not strike many 
as particularly family friendly. But Tye, the daughter of a theoretical- 
physicist father and a biochemist mother, grew up in her mother’s 
lab, where she was paid 25 cents per box to rack pipette tips. With her 
mother as a role model, Tye says that she was in her teens before it 
occurred to her that her gender could hold back her career. “And by 
then, my brain was already fully formed,” she says with a smile. 

Even so, Tye wasnt sure that science was for her. After graduating 
from MIT, where she first took up break-dancing, she travelled to 
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Australia to live on a cattle farm, in a yoga ashram and finally in a beach 
tent in an art commune. Her goal was to live moment-to-moment and 
write a novel based on her experiences. But Tye found her new lifestyle 
unfulfilling — and, she adds, the novel wasn't very good. She flew back 
home and started graduate school at the University of California, San 
Francisco. After rotating through the usual three labs without finding a 
suitable home, she begged neurobiologist Patricia Janak for the chance 
to do one last placement in her lab. “If you don't let me rotate, ’'m going 
to drop out,” Tye tearfully told her. 

Tye got the place and a new mentor: Janak, a successful female scien- 
tist with two children. And Janak watched Tye bloom. “Her insecurities 
rapidly disappeared,” she says. “She started to get amazing results.” In 
Janak’s lab, Tye published her first Nature paper after finding that in 
rats learning to associate a cue with a reward, there was a boost in the 
activity and synaptic strength of neurons in the amygdala, a brain region 
that in humans is associated with processing emotions (K. M. Tye et al. 
Nature 453, 1253-1257; 2008). But Tye wasnt satisfied: she wanted to be 
able to switch neurons on or off directly. That led her to optogenetics, a 
way to activate or inhibit specific neurons in rodent brains using light. 

After a two-year postdoc learning the technique in Karl Deisseroth’s lab 
at Stanford University in California, Tye landed at the Picower. She plans 
to use the approach to map the neural circuits that govern whether an ani- 
mal forms a positive or a negative association with a given environmental 
cue. Ultimately, she hopes that her studies can be used to devise ways to 
treat disorders such as anxiety, depression and addiction. 

Over the past five years, the Picower has recruited a number of young 
female faculty members, several of whom have since started families. 
(MIT opened a day-care facility across the street from Tye’s office in 
2004 and uses it as a recruitment tool.) It definitely helps to know they 
have paved the way, says Tye. 

Since her return to MIT a year ago, Tye has recruited four graduate stu- 
dents and four postdocs, applied for 13 grants, extended her list of top-tier 
papers and begun to prepare herself for the impact of motherhood. Some 
decisions are easy: the exercise bike in her office will be replaced with “a 
crib, or playpen, or whatever” for the times that her daughter accompanies 
her to work. Some are more difficult, like a trip to Tokyo to speak at a 
conference a month after the baby is due. Tye can’t say no, not yet. 

And tenure remains near the top of her list. “I never thought that 
my life had to be limited to anything,’ she says. “And I want to set that 
example for my daughter.” m 
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KEITY SOUZA SANTOS: VENOM DETECTIVE 
Animmunologist who studies allergic shock 
receives a shock of her own. 

BY ANNA PETHERICK 


hat should have been an ordinary Thursday for Keity 
Souza Santos turned out to be anything but. It was 4 a.m. 
when she woke up on 22 November 2012, tired but alert. 
She had been meaning to take a pregnancy test for days; 
now she decided she couldn't put it off any longer, and 
headed to the bathroom. Later, at work at the University of Sao Paulo 
Medical School’s allergy and immunology department in Brazil, San- 
tos, 33, told none of her colleagues why she had felt like screaming for 
joy hours earlier. She kept her secret even when one of them called to 
tell Santos that she had won the prestigious Young Investigator Award 
from the Sao Paulo Research Foundation. That meant that she would 
be starting her own lab at about the same time as her baby was due. 
Only it will not be just one baby; Santos is expecting twins. 
Santos studies life-threatening allergens in foods and insects, a serious 
threat in Brazil. Well known for its stunning biodiversity, the country 
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has more than 400 species of wasp compared with the Northern 
Hemisphere’s 30-odd. One species, Polybia paulista, causes hundreds 
of hospitalizations in Brazil every year. But doctors often have trouble 
pinpointing the cause of the allergic reactions. “Sometimes patients even 
bring the wasp to the hospital, but even then we cannot treat them prop- 
erly because we don't know what allergens are in the sting; Santos says. 

During a PhD at the University of Sao Paulo, Santos worked on an 
antivenom against the sting of the Africanized honeybee or ‘killer bee’ 
(Apis mellifera L.). As a postdoc, she studied the proteins responsible 
for anaphylactic reactions to cassava (Manihot esculenta), a staple food 
in north Brazil, and to the sting of several wasps. From P. Paulista alone, 
she and her colleagues separated out and identified 84 venom pro- 
teins — including some that had previously been found only in snake 
venom — and showed how they can trigger devastating tissue damage 
(L. D. dos Santos et al. J. Proteome Res. 9, 3867-3877; 2010). Now she 
is trying to identify the offending proteins in other insect venoms. 

To learn the mass spectrometry and other molecular techniques 


MONICA BETTENCOURT-DIAS: 
CELL MECHANIC 


A biologist who explores and shares the 
intricacies of the cell. 
BY ALISON ABBOTT 


onica Bettencourt-Dias grew up surrounded by role 

models. Despite being relatively poor, Portugal has an 

excellent record within Europe for appointing women to 

top positions in academia and other professions. Some 

think that the situation traces back to the 1960s and 1970s, 
when educated young men were sent to fight in Angola and Mozam- 
bique, leaving room to promote women and spawning a gender blind- 
ness in academia. On top of that, Bettencourt-Dias was raised bya 
supportive mathematician father and social-scientist mother, and she 
came of scientific age just as her country was introducing an innova- 
tive, government-paid doctoral programme, for which she was selected 
in 1996. “We had some of the world’s best scientists teaching us,” she 
recalls, “and I was able to learn that my destiny was cell biology and 
development.” 

Bettencourt-Dias travelled to University College London to study 
the regenerative properties of salamanders as part of her PhD. Later, as 
a postdoc at the University of Cambridge, UK, she discovered a master 
regulator of the centrosome, an organelle that organizes some of the 
key structures and machinery involved in the cell cycle, bagging her 
first Nature paper (M. Bettencourt-Dias et al. Nature 432, 980-987; 
2004). She returned to Portugal in 2006 to start her own laboratory at 
the Gulbenkian Institute of Science in Oeiras and the money has flowed 
generously ever since — including a prestigious €1.5-million (US$2- 
million) Starting Grant from the European Research Council. 

Now 39, Bettencourt-Dias’s life changed a few months ago with the 
long-awaited arrival of her adopted one-year-old daughter. But Portugal 
has abundant professional child-care places, and family members tend 
to be close by to help out. Bettencourt-Dias’s husband, also a scientist, 
does his share of the child-raising, and the couple has hired help for 
their domestic chores. 

In the lab, Bettencourt-Dias still focuses on the tight communica- 
tion and organization imposed by cellular-signalling pathways and 
centrosomes. There are parallels in her own life. Being well-organized 
has been essential to her career, she says, and she developed a drive for 
communicating science to the public that has led to regular participa- 
tion in workshops in Portugal's former colonies. Last year, she organized 
a molecular biology workshop in the west African island nation of Cape 


30 | NATURE | VOL 495 | 7 MARCH 2013 


required for the task, Santos spent months in labs in the United States 
and Austria. While abroad, she heard tales of sexism, something she 
says she did not encounter when growing up in Brazil. Santos says that 
her family was shocked when she announced (after reading about the 
cloning of Dolly the sheep) that she wanted to become a biologist, but 
not because of her gender. No one in her family was a scientist, and 
such a career was different from the world they knew. 

Now working largely independently, Santos's goal is to create kits 
that will help doctors to quickly identify the allergens to which a 
person has been exposed and how to detoxify them. But first she is 
focused on the challenges that this summer will bring. “My boss is a 
little bit worried,’ she says. “But I already have a PhD student and a 
technician. We can Skype a lot [during my maternity leave]. 

“T think she will manage,’ says immunologist Jorge Kalil, Santos's head 
of department, before adding after a pause, “but they are twins”. Santos 
has no such qualms. “I want to increase my group of students and col- 
laborators,” she says. “Why would I give up my scientific career now?” m 


Verde, which in 2008 became the first country in Africa to have a gov- 
ernment comprising a majority of women. The workshop had a similar 
number of men and women, Bettencourt-Dias says, and the attendees 
“told me they wanted to learn science to help their country — you don't 
hear this in Western countries”. 

Sharing is a life philosophy for Bettencourt-Dias; her discover- 
ies are recalled in those terms. One of her first such moments came 
in Cambridge when she and her first PhD student showed that an 
enzyme called PLK4 is important for the structure of fly centrosomes 
(M. Bettencourt-Dias et al. Curr. Biol. 15, 2199-2207; 2005). Together, 
they watched scores of tiny centrosomes forming under the micro- 
scope. “It is beautiful to share that moment with someone you are 
teaching,” she says. m 


© 2013 Macmillan Publishers Limited. All rights reserved 


ROBERTO KELLER-PEREZ 


TONY MARKS 


AMANDA WELTMAN: DRIVING FORCE 


Acosmologist who probes dark energy and 
ignores stereotypes. 


BY LINDA NORDLING 
hen Amanda Weltman discovered physics as an undergrad- 
uate at the University of Cape Town in South Africa, she 

W thought that “understanding the way the Universe worked 

was just about the coolest job anyone could have’. 

Weltman was just 24 when she shot to fame with a pro- 
posal about how the Universe works at the grandest scales. Her break- 
through paper, ‘Chameleon Cosmology’ (J. Khoury and A. Weltman Phys. 
Rev. D 69, 044026; 2004), published when she was graduate student at 
Columbia University in New York, gave rise to a popular theory to explain 
the phenomenon of dark energy — the mysterious force that is hypoth- 
esized to be speeding up the expansion of the Universe. 

Weltman and her colleague Justin Khoury suggested that a new force 
that changes according to its environment could explain many observa- 
tions about the Universe's expansion. This ‘chameleon’ force would be 
weak when particles are packed together, such as on Earth or in the early 
Universe. But as galaxies fly apart the force would grow, and accelerate 
the growth. What makes their theory popular is its testability: it predicts 
that a photon will sometimes decay into a chameleon ‘particle when 
travelling through a strong magnetic field. Experimental physicists have 
begun looking for this effect, but haven't yet found anything conclusive 
(see Nature http://doi.org/b96z3f; 2009). 

In 2009, after finishing her PhD at Columbia and a postdoc at the 
University of Cambridge, UK, Weltman moved back to South Africa. 
This enabled her to start a life with her husband, string theorist Jeff 
Murugan, whom she had met a decade earlier. Until that point, their 
courtship had been a typical case of academia’s ‘two-body problem’ — 
mostly conducted at great distances. Their return to South Africa was 
also driven by idealism. After years learning from the best in their disci- 
plines, they wanted to bring that expertise home. “We thought we could 
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be better put to use here to grow the country’s science and knowledge,” 
Weltman says. 

Back at the University of Cape Town, she is part of a large research 
group, but is also building her own — she has one student and one post- 
doc so far — to extend and test the chameleon theory. Last year she 
received a ‘P’ rating from the country’s National Research Foundation, 
a distinction given to a handful of young researchers who are on their 
way to becoming international leaders in their field. 

Weltman thinks that early barriers for women — the expectations 
that girls are better at soft sciences than hard ones, or that mathematics 


“IDONT THINK GIRLS. immertrtl ier 
NECESSARILY NEED 
GIRL ROLE MODELS, 


girl role models, but I think 

they need good role models,” 
she says. 

teaching. She admits that it was tough at times. Her husband, she says, 

“looked after the children as much as possible, so I could work. Together 

we make it work by finding the cracks in the day”. They go to conferences 


such stereotypes did not exist, 
Having her husband down 
the hall was handy after the 

asa family and take turns looking after the children. 
Academia offers flexibility, but it can still be a daunting place to start 


she says, and is grateful to 
have had role models, many 
of whom were men. “I don’t 

MODELS.” 
arrival of their two children, 
now 2 years old and 8 months 
old. Weltman, who is now 33, kept her research going through her four- 
a family, Weltman says. “When I was pregnant, I felt a little bit defensive 
and guilty, like I was admitting that my personal life was important to 
me by having a child,’ she says. “In physics, you are supposed to be life, 


think girls necessarily need 
month bouts of maternity leave, which in practice were only a leave from 
blood, flesh, dedicated 100% to your research.” = 
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COMMENT 


WOMEN Quotas could 
overburden already- 
stretched science stars p.39 


WOMEN Europe must build on 
its impressive efforts to close 
the gender gap p.40 


WOMEN Why do biographers 
stereotype female 
scientists as weird? p.43 


im, ,.. /ECHNOLOGY Two takes on 
%) why the Internet is no 
cure-all for social ills p.45 


CODY PICKENS 


Neurobiologist Jennifer Raymond (front) and her colleagues in their lab at Stanford University in California. 


Most of us are biased 


Let’s move beyond denial, own up to our prejudices against women 
and retrain our brains to overcome them, says Jennifer Raymond. 


Please don’t hold this against me. I am 

a woman scientist, mentor and advo- 
cate for women in science, and an associate 
dean in my school’s Office of Diversity, with 
a budding field biologist as a daughter. Yet 
my performance on the Implicit Association 
Test (https://implicit.harvard.edu/implicit/ 
demo), which measures unconscious associ- 
ations between concepts, revealed that I have 
a tendency to associate men with science and 
career, and women with liberal arts and fam- 
ily. I didn’t even need to wait for my score; 
I could feel that my responses were slower 
and that I made more mistakes when I had 
to group science words such as ‘astronomy’ 


| have a bias against women in science. 


with female words such as ‘wife rather than 
male words such as ‘uncle. 

The results from hundreds of thousands 
of people indicate that I am not an outlier — 
70% of men and women across 34 countries 
view science as more male than female’. 

Gender bias is not just a problem in science. 

A host of studies shows that people tend 
to rate women as less competent than men 
across many domains, from musical abilities 
to leadership’, and that many individuals 


WOMEN IN SCIENCE 


The gender gap and how to close it 
nature.com/women 
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hold biases about competency on the basis 
of other irrelevant attributes, such as skin 
colour, body weight, religion, sexual orien- 
tation and parental status. 

Such biases have important consequences 
in the workplace. One study showed that 
mothers are 79% less likely to be hired 
and are offered US$11,000 less salary than 
women with no children’®. By contrast, the 
same study shows that parenthood confers 
an advantage to men in the workplace. 

A 2012 study by Jo Handelsman of Yale 
University in New Haven, Connecticut, and 
her colleagues shook the scientific commu- 
nity by reporting that science faculty mem- 
bers have a pervasive bias against female > 
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BIAS BUSTERS 


Ways to conquer gender assumptions 


@ Raise awareness of gender bias as a first 
step to overcoming it. Call for transparency 
in salaries, hiring, leadership and editorial 
decisions. Organize a discussion of implicit 
bias in science, and what can be done 
locally to address it. At Stanford University, 
we have a Gender Issues in Neuroscience 
discussion group that brainstorms ways to 
overcome challenges such as competition, 
response to failure, networking and speech 
and body language. We include men in these 
discussions because they are also affected 
by gender stereotypes and are an essential 
part of the solution. 

@ Use gender-blind review or other 
processes to mitigate bias when reviewing 
applicants for a job, award, speaking 
engagement, grant or manuscript. 

Define measurable review criteria in 
advance to avoid a gut response, which 

is most vulnerable to bias. Be vigilant 

for rationalizations that could reflect an 
unconscious bias, such as “she’s great, but 
seems awtully young/is not a good fit/is 
working in such a competitive area”. Create 
an environment in which it is acceptable 
to question colleagues when bias might be 
influencing their behaviour. It is easier to 
detect bias in others than ourselves®, so we 
need to help each other without judgement. 
It is especially helpful if men initiate 
conversations about gender bias so that 
women don't bear the full responsibility. 

@ Make a conscious effort to offer women 
mentoring and other support, including 

an equal salary to male peers, to overcome 
the documented tendency to offer women 


> scientists*. This prevents us from doing 
our job of promoting the best scientists, 
and society is paying a price in terms of the 
advancement of science. 

There is now sufficient evidence to move 
us beyond the denial phase of dealing with 
gender bias. Yet in talking to colleagues 
around the world, I find continued resist- 
ance to the idea that scientists, who take 
pride in being rational and objective, could 
be influenced by bias. One colleague was 
convinced that gender bias could affect the 
hiring of a lab manager, but he still doubted 
that it would affect a faculty-level hiring 
decision or the evaluation of a manuscript, 
even though the evidence suggests other- 
wise*. And I have seen junior colleagues 
shake their heads disapprovingly at the 
gender bias of older science faculty mem- 
bers, yet resist the idea that their generation 
might also have such bias. 

Unfortunately, young people are not 
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less*. Trumpet the achievements of female 
colleagues, because biases have the 
greatest influence when there is a dearth of 
specific information’. 

@ Women should overcome their own 
gender bias because it could make them 
less likely to compete for prestigious 

jobs or awards*®. Be proactive in seeking 
mentorship, and negotiate for salary 

and other resources. Offer your talent to 
employers who have programmes to help 
level the playing field for women. Join or 
start a women-in-science group, especially 
within your own scientific subspeciality, 
because such groups can provide speaking 
invitations, tenure letters, advice about 

the key issues and players in the field, and 
reviews of papers and grants. For about 

15 years, women in my subspeciality have 
got together at our national neuroscience 
meeting for an annual event that we 
irreverently call the Babes of the Vestibular/ 
Oculomotor System Dinner. 

© Fund pilot projects to test innovative 
interventions to mitigate the effects of 

bias, and create a central repository for 
sharing strategies. These programmes 

will more than pay for themselves if they 
help to retain the best talent. Considerable 
resources are being invested in training 
each young scientist — if we want to be 
good stewards of that investment, we need 
to provide everyone, male and female, with 
a fair shot at success. Institutions should 
provide incentives, such as salary support or 
alleviation of other duties, to individuals who 
spearhead efforts to address implicit bias. 


immune to gender bias. Many studies have 
been conducted on college-age subjects, and 
gender bias has even been reported in pre- 
school children®. I tried to protect my own 
children from gender bias by doing things 
such as changing the gender of the charac- 
ters in the children’s books I read to them 
to reverse gender stereotypes, and using 
the feminine pronoun wherever possible 
— “Look at the elephant; she is so strong.” 
Despite these efforts, my daughter had a 
bias against women in leadership positions 
by the age of three. One day in the park, she 
announced, “I am the captain; I'm a girl cap- 
tain,” suggesting that she knew she had to 
violate a gender stereotype to assume that 
leadership position. And although she has a 
scientist mum who runs a lab full of women, 
when my daughter took the implicit associa- 
tion test at age 8, it revealed a bias against 
women in science. My presence as a role 
model and other efforts at countering gender 
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stereotypes were not enough to overcome 
the powerful cultural transmission of bias. 
Thus, it seems unlikely that unconscious 
gender bias will be eradicated any time soon, 
and the best we can do in the near term is to 
suppress its symptoms. 

If we are vigilant, we can reduce the influ- 
ence of bias on our decisions. Unconscious 
biases are mental habits that tend to domi- 
nate our gut reactions, but we also have 
more-rational decision processes, which 
compete with our biases for control of 
behaviour. Just as one can overcome physi- 
cal habits such as biting one’s fingernails 
or saying ‘umm when one speaks, one can 
suppress undesirable mental habits such as 
gender bias through deliberate, conscious 
strategies (see “Ways to conquer gender 
assumptions’). By enabling more women 
to succeed, despite the existence of uncon- 
scious bias, this will gradually eliminate the 
stereotype of the successful scientist as male, 
which is the root of gender bias. 

However, if left unrecognized and 
unchecked, bias can commandeer both our 
behaviour and our rational thought processes. 
Our brains are skilful at creating seemingly 
rational justifications for our behaviour, even 
when it is driven by bias. People who had to 
rate two ‘applicants’ for police chief — one 
who had more education and the other who 
had more experience — always chose the man 
over the woman, but justified their choice as 
arising from the value they placed on either 
education or experience, whichever factor 
was assigned to the man’. 

Denial that bias exists gives it more power. 
I am not proud of my unconscious bias 
against women in science. However, I know 
that I must first recognize my own bias to 
overcome it with deliberate practices that 
suppress its effects. I urge you to join me. m 


Jennifer Raymond is associate professor 
of neurobiology and associate dean in the 
Office of Diversity and Leadership at the 
Stanford University School of Medicine, 
Stanford, California 94305-5125, USA. 
e-mail: jennifer.raymond@stanford.edu 
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Scientists of the 
world speak up 
for equality 


Eight experts give their prescriptions for 
measures that will help to close the gender 
gap in nations from China to Sweden. 


a 


LIHADH AL-GAZALI 
Remove social 
barriers 


Clinical geneticist at the United Arab 
Emirates University in Al-Ain 


Just 1% of Saudi Arabia's researchers were 
women in 2011, according to the Inter- 
national Labour Organization. This low 
number is particularly surprising given 
that 65% of the nation’s bachelor’s science 
degrees go to women. Similar patterns are 
evident in the rest of the Arab Middle East. 
Women are clearly interested in science. But 
many cannot continue their careers because 

of limiting social attitudes in traditional 

Arab societies. 

The expected role for women — 
graduates included — is housewifery. In 
some areas, women must ask the permission 
of the men of the household even to leave 
their house. Conservative families may not 
allow their daughters to work in mixed-gen- 
der workplaces. To pursue advanced train- 
ing in research often requires postgraduate 
study elsewhere. If a woman wishes to do 
this, the household patriarch may mandate 
that a male family member accompanies her 
abroad. 

Despite these restrictions, the pool of 
highly qualified women scientists continues 
to grow in some Arabic countries. Accord- 
ing to the Organisation of Islamic Coopera- 
tion in Jeddah, Saudi Arabia, women now 
represent 19% of researchers in the occupied 
Palestinian territories and 22% in Libya. Few 
of these women are university presidents, > 
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> directors or department heads. 

Having more women in these positions 
would help to shift cultural expectations. 
Leading Arab women scientists should 
continue to get involved in the political 
life of their countries, where they can be 
strong advocates for other women scien- 
tists. Developments such as the inclusion 
of leading women academics in the Shura 
council, Saudi Arabia’s highest advisory 
council (which is now 20% female), and in 
the Federal National Council of the United 
Arab Emirates (22% female), are steps 
in the right direction. Highlighting suc- 
cess stories will encourage qualified Arab 
women to pursue careers in science. Role 
model and mentoring initiatives are also 
important. Examples of such programmes 
include the Stars of Science initiative by 
the Qatar Foundation for Education, 
Science and Community Development, and 
the TechGirls Exchange Program of the US 
Department of State. 

Universities and professional organiza- 
tions must help to educate the public about 
what science entails by inviting families to 
join conferences, careers days or network- 
ing events that include presentations of the 
achievements of women scientists. 


VIRGINIA VALIAN 
Invite women 
to talk 


Psychologist at Hunter College and 
the Graduate Center, City University 
of New York 


In 2003, I was invited to give the keynote 
speech at an event held annually by the Sigma 
Xi scientific-research society to honour scien- 
tists’ achievements. I was asked to speak about 
women in science. During dinner, I scanneda 
list of the event’s previous speakers, from 1964 
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on, to count how many women had given the 
keynote. Most were listed only by surname 
and first initial, so between courses, I walked 
around the room asking people whether they 
knew anyone on the list. Eventually, I found 
an older scientist who reviewed the list, rec- 
ognizing every name, then turned to me with 
a surprised, rueful smile: in nearly 40 years, I 
was the only woman to speak at this event. He 
had heard nearly every lecture but had never 
noticed that they were all given by men. 

At the beginning of my talk, I used that 
story as an example of how hard it would be 
for organizers and attendees to detect sucha 
pattern, given only one data point a year. The 
pattern isn't a product of discrimination or 
intentional exclusion of women. Rather, few 
people — men or women — think of women 
when they picture ‘top’ scientists who might 
headline an event. 

Are men still disproportionately featured at 
conferences? Determining an expected num- 
ber is hard, but it is still relatively rare to find 
women giving plenary or keynote speeches 
at conferences. At the American Chemi- 
cal Society's upcoming spring meeting, for 
example, all four of the planned plenary 
speakers are male. That doesn’t send an opti- 
mistic message to young female chemists. 
The blog Feminist Philosophers lists nearly 
20 recent philosophy conferences — many 
of which focus on science — featuring only 
male speakers. 

At a language-processing conference I 
attended recently, I went to 15 or so talks, 
making note of who asked questions in 
each one. Women were more likely to ask 
questions in sessions chaired by women, 
regardless of the speaker's gender. If that is 
a general pattern, bringing more women 
into prominent positions in conferences 
will increase women’s overall participation 
in scientific discourse. Many organizations 
strive to include scientists who are from non- 
English-speaking countries — the same can 
be done for women. 

To that end, my colleague Dan Sperber, 
a cognitive scientist at the Central Euro- 
pean University (CEU) in Budapest, 
and I have created an online petition 
(go.nature.com/sj4yed) whose signatories 
commit to accepting talk invitations only 
from conferences that have made good-faith 
efforts to include women. So far, we have 
more than 450 signatures — but few of them 
are from senior male scientists, and even 
fewer are from scientists in the United States. 

What counts as a good-faith effort? 
There is no single prescription, but Sperber 
and I have adopted a few suggestions from 
Feminist Philosophers. For one, organizers 
should seek out women in relevant fields to 
speak at conferences — and keep looking 
if the first woman they ask says no. Other 
examples include extending invitations 
early so that women have time to make 
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arrangements, and offering child-care 
services at meeting sites. 

But efforts should go beyond the indi- 
vidual. Conference funders should be 
mindful of gender equity for invited speak- 
ers. Similarly, universities should follow the 
example of the CEU, which now requires 
that organizers of university-funded events 
show good-faith efforts to include qualified 
female speakers (go.nature.com/ym81ws). 


P| 


BEN BARRES 
Allow time for 
postdoc babies 


Neurobiologist at Stanford University 
in California 


Three decades ago, when Nobel laureate 
Rosalyn Yalow spoke to a women in science 
group at a major university, her opening 
statement was: “The primary problem is 
childcare. Everything else is secondary.” 
Fortunately, many universities now rec- 
ognize the huge amount of time needed 
to raise children and help to offset this by 
providing an extension of 1 year to assistant 
professors who have a child while seeking 
tenure. Increasingly, however, scientists are 
having babies during graduate or postdoc- 
toral training, which together can now total 
10 years or more. 

I propose that universities modify their 
tenure-clock extension rules to cover chil- 
dren born at any stage in a career. So even 
if people already have children when start- 
ing out as assistant professors, they should 
be offered an additional year per child (up 
to two children, perhaps) to obtain tenure. 
Even though women typically provide more 
of the child care than do men, it seems only 
fair to provide the same extension to men. In 
the modern world, both parents often work 
and like to eat dinner with their children 
(and so cannot write grant applications and 
papers until midnight). 
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This change would remove a persistent, if 
unintentional, form of discrimination that 
deters many young scientists, particularly 
women, from choosing tenure-track jobs, 
and would increase the chance that those 
who do will make tenure. At Stanford, our 
provost routinely grants tenure-clock exten- 
sions when requested. Why not just make it 
the rule at every university? 

The main argument against extending 
tenure clocks is that it might help men more 
than women, because men may use the extra 
time to amass papers whereas women use it 
to raise children. If so, one could argue that 
this is also true of the tenure-clock exten- 
sions already granted. Yet most agree that 
these have been beneficial to all involved. If 
universities wish to achieve a more diverse 
faculty, we must continue to remove obvi- 
ously discriminatory policies. 


LING-AN WU 
Equalize the 
retirement age 


Physicist at the Institute of Physics, 
Chinese Academy of Sciences, 
Beijing, China 


There are two things that China can and 
should do to make it easier for women to 
succeed in science: enforce laws that grant 
equal opportunities to women in the work- 
place, and make the retirement age the same 
for men and women. 

In some ways, it is harder to be a woman in 


science in China today than it was 50 years 
ago, before the Cultural Revolution. Then, 
under the socialist system, men and women 
were given jobs based purely on their per- 
formance, so sex ratios were relatively fair. 
Now that more-capitalist principles infiltrate 
job placement, it has become more difficult 
for women to find work. I benefited from 
the socialist system. After working on a farm 
for three years, I was assigned a job doing 
translations at Beijing's Institute of Physics, 
Chinese Academy of Sciences, because I was 
fluent in English. This enabled me to go to 
the United States and obtain a PhD in phys- 
ics. On returning to the institute with my 
new skill set, I became a researcher. 

Now, women face discriminatory prac- 
tices that make it harder for them to succeed. 
For example, in most Chinese institutions, 
women who are not full professors are 
required to retire by age 55, 5 years before 
men. The earlier retirement age was origi- 
nally established to protect women perform- 
ing manual labour. In science it essentially 
prevents them from reaching the same career 
goals as men, particularly if they had to take 
time off to raise children. In the past, this 
practice was not compulsory for academia. 
It was put into effect at the turn of the twenty- 
first century to open up positions for young 
returnees from abroad, who were mostly 
men. This led to a fast drop in the propor- 
tion of women holding lab or department 
directorships — for example, at my institute, 
that percentage fell from around 20% before 
the 1990s to 6% in 2003. (The proportion of 
female associate scientists has risen to the 
original level of 27%, but they still make up 
only 13% of the full-scientist ranks.) 

Furthermore, bias now plays a major 
part in job recruitment. Even in academia, 
I frequently hear faculty members — even 
women — saying that they would prefer 
to hire male students. Private companies 
advertise for men only, or decline to inter- 
view female candidates. Although a 1995 law 
prohibits discrimination by employers based 
on gender in China, this is often completely 
disregarded. 

Still, the country has made progress. In 
2011, the National Natural Science Foun- 
dation of China, which oversees the larg- 
est source of government funding for 
fundamental research, raised the age of its 
young investigators’ awards from 35 to 40 
for women, to give them time off to raise 
children. This allows women to compete 
more fairly with their male counterparts, 
especially important because the number of 
day-care centres has plummeted owing to 
rising costs. Today, Western and old feudal 
perceptions spread by the media have led to 
the saying: “It is more worthwhile to find a 
good husband than a good job.” But no one 
says: “It is more worthwhile to find a good 
wife than a good job” This saddens me. 
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EVA Y. ANDREI 
Inspire our 
daughters 


Physicist at Rutgers University in 
New Jersey 


As a female physicist Iam a ‘rare bird’ — a 
member of a tiny minority, scarcer than in 
any other field of science and engineering. 
Women's representation in physics at major 
US research universities hovers at about 
13%, for many reasons. In my view, these 
demographics alone make it hard to find 
female plenary speakers, conference organ- 
izers and journal reviewers. 

It would be counterproductive to restore 
the gender balance by burdening the 13% 
with more refereeing or committee work. 
Instead, the solution needs to be sought 
through inspiring and mentoring high- 
school girls and undergraduate students. 

We must also ask whether girls are less 
inclined towards physics than boys. Is there 
something in the discipline or its culture that 
turns them away? Can this be changed? 

Iam somewhat heartened to see that the 
fraction of physics PhDs earned by women 
has increased from 2% in 1966 to 18% in 
2010. The percentage of women faculty mem- 
bers at every rank matches the numbers who 
graduated in the respective years. Still, we are 
not there yet, and the decline since 2002 in the 
number of women earning a bachelor’s degree 
in physics is a further cause for concern. 

When I polled my women colleagues, 
almost all agreed that a dearth of guidance 
and mentorship early on was the main rea- 
son for the lack of female physicists. 

The rising tide of women joining the 
profession will encourage more young 
women into physics. In the meantime, we 
should showcase successful female physi- 
cists through lectures and prizes, and retain 
young scientists by offering travel bursaries 
and by stopping tenure clocks. But to really 
solve the problem, we must inspire our high- 
school daughters. 
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JO HANDELSMAN & 
CORINNE MOSS-RACUSIN 
Institute training 
to reduce bias 


Microbiologist, and social 
psychologist, Yale University, 
New Haven, Connecticut 


In 2012, we published a study showing that 
scientists of all ranks and both genders are 
more likely to hire, mentor and pay more to 
a John’ than a Jennifer. Soon afterwards, we 
were contacted by the office of US congress- 
woman Louise Slaughter about ways that 
policy-makers might help to promote gender 
equity in science. 

We proposed that the ethics training now 
required for students funded by grants 
from the US National Institutes of Health 
be expanded to include gender-bias train- 
ing. Indeed, we feel that all scientists should 
go through such training. It has been 
shown that students who undergo diver- 
sity training score markedly lower on tests 
of implicit bias than students who do not 
(L. A. Rudman et al. J. Pers. Soc. Psychol. 81, 
856-868; 2001). 

Other approaches can also make a dif- 
ference — the board game Wages, designed 
by Stephanie Shields, a psychologist at 
Pennsylvania State University in University 
Park, and her colleagues, shows players how 
subtle disadvantages to women can have 
enormous cumulative impact on their careers. 

Visual priming works too — for example, 
people who have viewed images of disliked 
white people and admired African Ameri- 
cans within the past 24 hours are less likely 
to show automatic pro-white attitudes 
(N. Dasgupta & A. G. Greenwald J. Pers. Soc. 
Psychol. 81, 800-814; 2001). 
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For more sustained effects, we could 
paint murals of admired female scientists 
throughout the halls of universities. A 
large image of, say, Rosalind Franklin, 
viewed daily by students in an introduc- 
tory molecular-biology class, might be 
even more powerful than explicit instruc- 
tion on implicit bias. Scientists have agreed 
on standards and training to ensure proper 
treatment of animal and human research 
subjects. The people doing the research 
are just as important. Striving for equality 
should bea core aspect of being a scientist. 


LIISA HUSU 
Recognize hidden 
roadblocks 


Professor of gender studies at the 
GEXcel Centre of Gender Excellence, 
Orebro University, Sweden 


In researching women in science and aca- 
demia, I have found that it is not only the 
things that happen to women — such as 
recruitment discrimination or belittling 
remarks — that affect them in pursuing a 
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career in science or that slow their career 
development. It is also the things that 
do not happen: what I call ‘non-events’ 
(L. Husu Adv. Gender Res. 9, 161-199; 
2005). 

Non-events are about not being seen, 
heard, supported, encouraged, taken into 
account, validated, invited, included, wel- 
comed, greeted or simply asked along. They 
are a powerful way to subtly discourage, 
sideline or exclude women from science. 
A single non-event — for example, failing 
to cite a relevant report from a female col- 
league — might seem almost harmless. But 
the accumulation of such slights over time 
can have a deep impact. 

Non-events can be manifold. Superi- 
ors or colleagues might ignore or bypass 
women’s research and performance; fail 
to invite or welcome them to important 
informal and formal networks; bypass them 
for awards, prizes or 


invitations; failto “Non-—events 


give them merit- are apowerful 
advancing tasks such way to subtly 
as representing the discourage, 
research group in sideline or 
public forums; not exclude 

ask them to design or women from 


participate in scien- 
tific meetings, con- 
ferences, panels or 
as keynote speakers; or simply stay silent 
when it comes to career support, advice 
and mentoring. Even supposedly small 
non-events can send a powerful message, 
such as when a female postdoc publishes 
a high-profile article that generates no 
reaction from senior local colleagues, while 
her male counterpart’s parallel article is cel- 
ebrated with high-fives all round. 

Non-events are challenging to recognize 
and often difficult to respond to. Nothing 
happened, so why the fuss? Often, non- 
events are perceived only in hindsight or 
when comparing experiences with peers. 
Learning to recognize various non-events 
would help women scientists to respond 
to them, individually or collectively, with 
confidence and without embarrassment. 
Anonymous pooling of non-event experi- 
ences would be an eye-opener and a good 
start to understanding how non-events 
work in various scientific settings. 

All scientists — leaders, gatekeepers, 
rank and file — need to be aware of how 
they might inadvertently exclude women 
from crucial collegiality. Monitoring the 
practices of support, encouragement, 
inclusion and exclusion in research 
groups, projects, networks, conferences 
and science institutions from a gender 
perspective would be a first step forward. 
Addressing this issue in management 
and supervisor training and early-career 
coaching is key. = 


science.” 
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Quotas are questionable 


Measures to give women a fair chance in science should be based on evidence, 
warns Isabelle Vernos, or they could make matters worse. 


professors, and 18% of full professors, are 

women, despite equal proportions of men 
and women at the undergraduate level’. To 
address the problem of gender imbal- 
ance, the European Commission 
(EC) has committed to reaching 40% 
female participation in its advisory 
structures for Horizon 2020, the 
European Union's research-funding 
programme for 2014-20. The EC has 
also proposed a mandatory quota of 
40% for women on non-executive 
boards of public companies. 

But statistics collected by the 
European Research Council (ERC) 
suggest that quotas are no magic 
wand to bring about gender equal- 
ity in research and academia (despite 
tentative successes elsewhere, such 
as for company boards in Norway). 
Quotas might even make matters 
worse by overworking already- 
stretched female scientists. Instead, 
a range of bottom-up and top-down 
measures are needed to effect lasting 
change in the structures and culture 
of science. 


1E Europe, only 36% of mid-ranking 


Share of female experts on panels (%) 


THE ERC EXPERIENCE 

The ERC, launched in 2007, provides up to 
five years of funding for scientific-research 
projects chosen by peer reviewers through 
a transparent competition system. So far, 
women make up a disheartening 19% of the 
ERC’s 3,500 grantees and account for just 25% 
of the nearly 35,000 applications received by 
the ERC to date; 29% of applicants for early- 
career grants, and only 15% of those for 
advanced grants, are female. 

In 2010, the ERC implemented some 
recommendations — made by its gender- 
balance working group (GBWG) under the 
excellent leadership of outgoing chairwoman 
Teresa Lago — suchas increasing the window 
of grant eligibility for applicants who have 
children (the US National Science Founda- 
tion adopted similar measures in 2011). The 
number of female applicants for ERC grants 
has increased, but so has the number of male 
applicants — the gap has not narrowed. 

Another concern is the lower success rate 
of female applicants for ERC grants: 10% on 
average, versus 12% for men. This general 
trend has been observed in other funding 
schemes, including the European Molecular 
Biology Organization’ and the international 


Human Frontier Science Program’. Despite 
several studies on the issue, the reasons 
for these disparities are still elusive**. The 
dearth of women in academia’s upper ranks 


GRANT GAP 


Aggregating data for 2008-12, the European Research Council 
found no correlation between the percentage of women on its 


Difference in success rates (%) 


translates into their scarcity on committees, 
and it has been argued that this gender imbal- 
ance could be feeding back into the lower 
success rate of female applicants for funding 
schemes and positions. 

Ido not think this is so — at least not at the 
ERC. We have found no correlation between 
the success rates of female applicants and 
the gender balance of evaluation panels 
(see ‘Grant gap’). Nor have we found that 
female applicants are more successful when 
the panels are chaired by women. Other stud- 
ies have found that women fare worse than 
men in evaluations‘, even when applicant 
gender is undisclosed to evaluators’. These 
findings suggest that a quota system for staff- 
ing evaluation panels will not lead to more 
grants for women. 

Worse, quotas would place greater demand 
on the small pool of female scientists who 
would serve on these panels — possibly 
enough to hamper their career progress. 
Scientists are evaluated on the basis of 
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creativity and productivity, and scientific 
excellence requires steely focus and lots of 
time. Female scientists are more likely than 
males to bear domestic duties””, making 
their time already stretched. My esti- 
mate, based on the proportion of ERC 
grantees who are women, is that the 
burden of panel participation would 
be three or more times higher for these 


evaluation panels and the success rate of female applicants. 

A ees eM ROI SOOT NIN Ee DAR Hed women than for men in equivalent 
e positions. That said, concerted efforts 
OME Ste cleo dace hea aap alae teleseaveutes should be made to identify qualified 

ba women for all posts. 
2 — ae Given that measures taken to date 
30 “ay s ‘5 have had limited impact, some argue 
e . ° . ® that mandatory quotas are the best 
25) e ‘ way to accelerate gender balance in 
research (see page 42). I disagree. 
=) oe e There are many pockets of good 
13 eee as % practice — such as mentorship pro- 
@ Physical and grammes, family-friendly policies and 
10 engineering sciences 2 transparency in recruitment — and 
@ Life sciences these should be expanded. Further 
5 @Social sciences od WMenmore | Women more hk. solutions must also be sought. The 
A snes Successful | successful GBWG has commissioned a study, 
Ae Sin) ee a wee eg 2 4 called ERCAREER, to examine the 


career paths of young female and male 
scientists. Another study, to begin this 
year, will look into possible sources of 
gender bias in the ERC’s evaluation processes. 
Progress towards improving women’s rep- 
resentation in science is too slow. More efforts 
are needed to understand the reasons for 
gender disparities and to recruit, retain and 
promote excellent female scientists — and 
thus increase the research talent pool and 
power scientific progress. Europe's future 
success requires a society that recognizes 
talent and offers equal opportunity to all — 
through evidence-based measures. = 


Isabelle Vernos is chair of the ERC GBWG 
and is ICREA Research Professor at the 
Centre for Genomic Regulation, 08003 
Barcelona, Spain. 
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At the European Parliament in Brussels, the European Platform of Women Scientists is calling for binding gender targets. 


Only wholesale reform 
will bring equality 


Providing equal opportunities for women in science requires change at 
every level, argue Brigitte Mithlenbruch and Maren A. Jochimsen. 


ender-equality policy in science at 
Gee regional and national level 

has come a long way, thanks to more 
than 20 years of efforts by women scientists 
and far-sighted politicians. Yet progress is 
slow. The pipeline is still leaking and female 
researchers still hit glass ceilings. 

In 2007, in the 27 countries that comprise 
the European Union (EU), women scientists 
accounted for 38% of active researchers and 
only 19% of full professors, on average’. In 
2009, 45% of doctorates were awarded to 
female students’. Although the number of 
women PhD graduates is growing — up by 
an average of 4.9% per year between 2004 
and 2009, compared with 3.2% for men’>— 
this is not enough to suggest that science’s 
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gender imbalance is self-correcting. 

When it comes to women in decision-mak- 
ing positions in science and research, Europe 
isa long way from its 2001 target of 40%. With 
the exception of Sweden, Finland and Nor- 
way, women in Europe are still significantly 
under-represented on the boards of research 
institutions, funding organizations, scientific 
councils and academies, and are rarely found 
among the heads of higher-education institu- 
tions in the majority of European countries”. 
In short, Europe could do better. 
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The persistent gender gap has prompted 
great changes in equal-opportunities strat- 
egies at European and member-state level, 
particularly since the launch of the EU’s 
Women and Science activities in 1998. At 
first, policy concentrated on individual pro- 
grammes to equip women scientists with the 
necessary soft skills to advance, such as net- 
working, mentoring, stipends, training and 
the provision of role models. These remain 
indispensable instruments for encouraging 
individuals. 

However, impediments to women scien- 
tists have deep institutional roots. The lack 
of full female participation in academic 
careers is often a systemic consequence 
of the culture and organization of higher- 
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education institutions. Consequently, 
attention is shifting in the EU to encourag- 
ing structural changes in research organiza- 
tions. Measures include increasing diversity 
in recruitment; introducing promotion and 
retention policies; updating management 
and research-assessment standards; devel- 
oping course content to successfully attract 
women as well as men; policies for dual- 
career couples; and schemes that allow 
women to return to work after career breaks’. 
To achieve lasting equality, science needs a 
culture that is sensitive to gender and diver- 
sity in all its endeavours: individual and 
social, structural, institutional and political. 
We need transparency, accountability and 
monitoring in decision-making, evaluation, 
recruitment, attribution and funding. We 
need to secure the interest and collaboration 
of highly qualified women and men by offer- 
ing predictable academic careers, attractive 
working places and conditions that enable 
work and life to be reconciled’. 
Furthermore, we need to agree that gen- 
der is indispensable to research itself. For 
example, given that there are important 
sex differences in responses to many drugs 
and therapies, the underrepresentation of 
women in clinical trials must be fixed. This 
applies to animal studies too: laboratory 
experiments in mice predominantly use 
male animals, limiting what can be inferred 
from findings. Gender should be addressed 
from proposals to papers and beyond’. 


BEST PRACTICE 

Changing the academic culture will take 
a mixture of voluntary commitments and 
binding regulations — all backed up by 
funding. Here we describe some examples 
of measures that are having good effects and 
that should be replicated elsewhere. 

In 2008, the member organizations of 
the German Research Foundation (DFG; 
Germany’s largest research-funding body) 
committed to a set of structural and per- 
sonal guidelines called Research-Oriented 
Standards on Gender Equality. These call 
on member institutions to make gender 
equality integral to management, human 
resources, organizational development, 
strategy and content, resource allocation 
and quality-assurance procedures. The 
standards also demand that institutions 
publish data on gender equality at all organ- 
izational levels and academic career stages. 
The guidelines require that institutions 
design procedures in a transparent, struc- 
tured and formal manner, that they counter 
outdated gender stereotypes, accommodate 
individual life plans and empower men and 
women to combine family life and academic 
careers. 

Responsibility for implementing the 
standards lies with each member institution, 
and the DFG uses incentives and evaluation 


reports to ensure adherence. To help, the 
foundation also provides an online toolbox 
of practical examples (see go.nature.com/ 
dojcsz; in German). Implementing the 


standards is voluntary but is a condition of 


DFG funding. 

These standards have led to progress. For 
example, most DFG member institutions 
now have flexible working schedules, child- 
care facilities and other family services. Most 
member universities have implemented 
transparent, structured and formalized 


procedures for the unbiased evaluation of 


scientific quality and the hiring of profes- 
sors. Other positive developments include 
the endeavour to replace stipends, especially 
PhD grants, with employment contracts that 
include social-security benefits. 

The University of Duisburg-Essen, 
where one of us (M.A.J.) manages the Essen 
College of Gender Studies, is particu- 
larly notable for its pro-women activities. 
These include a university-wide mentoring 
system; a nationwide network on university 
course development and teaching for shar- 
ing expertise in gender issues; and an online 
portal of gender information for staff. The 
institution even has a vice-rector for diver- 
sity management — the first post of its kind 
at a German university. 

At European level, the European 
Research Council (ERC) published a 
gender-equality plan in 2011 (ref. 6). Each 
process within the ERC — from advertis- 
ing to grant signing — is designed to give 
equal opportunities to men and women 
(see page 39). Among other measures, the 
ERC sets goals for and monitors the gender 
balance ofits panels of peer reviewers on the 
basis of information from relevant scientific 


communities and its pool of applicants. Ifa 
goal is not reached this must be reported, 
with an analysis of how the situation can be 
improved’. The ERC also commits to chal- 
lenging potential sources of gender bias in 
the evaluation process, for example by pub- 
lishing men and women’s submission rates, 
success rates and granted amounts. The 
council offers gender-equality training to 
ERC scientific officers and discusses gender 
awareness with evaluation panels, highlight- 
ing how they should evaluate career breaks 
and unconventional research career paths®. 


MEASURES OF MERIT 

Other gratifying recent measures include 
a 2011 recommendation by the European 
Science Foundation (ESF) for its member 
organizations to attain a gender ratio of 
at least 40% women among grant review- 
ers. When selecting experts, the ESF also 
encourages the consideration of individ- 
ual non-standard career paths affected by 
changes or interruptions caused by profes- 
sional mobility or family reasons’. Among 
university associations at European level, the 
League of European Research Universities 
took the lead in mapping out what universi- 
ties and their affiliated institutions can do to 
bring about change’. 

Germany's Programme for Women Pro- 
fessors, launched in 2007, is an exemplary 
blend of state prescription and voluntary 
institutional commitment. The programme 
funds universities for appointing women 
to the rank of full tenured professor. To 
be selected, a university must produce a 
coherent gender-equality plan. So far, the 
programme has led to more than 260 new 
female professorships at 109 universities 
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A chemistry student at Germany’s University of Duisburg-Essen, notable for its pro-women activities. 
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(see go.nature.com/imunpf; in German). It 
has also funded measures such as increasing 
the number of women in decision-making 
positions, providing career development for 
young female researchers and boosting the 
proportion of women in disciplines that have 
low female participation, such as engineer- 
ing, computer science and physics. 

Although voluntary targets can achieve 
much, binding regulations are the only way 
to effect change in some cases. Quotas, as 
contested as they are, are another way to 
counter the under- 


representation of “Binding 
women scientists "egulations 
in decision-making re the only 
positions in research Way to effect 
organizations. Inthe changein 
Nordic countries and some cases.” 


Austria, for example, 

quotas of at least 40% of each gender are 
mandated in the administrative parts of 
research organizations. 

From this year, the German Leibniz Asso- 
ciation, a high-profile umbrella organization 
of 86 non-university research institutions, 
has become one of the first research organi- 
zations to introduce binding, merit-based 
quotas to encourage equal opportunities. 
The quotas use a ‘cascade model’: each level 
of university hierarchy in each discipline 
must, by 2017, reach at least the same pro- 
portion of women as is present at the level 
below. Such flexible quotas are sensitive to 
varying numbers of men and women in dif- 
ferent scientific disciplines. 


BEYOND ACADEMIA 
The structural measures outlined above 
should be applied to other leading institu- 
tions, such as academic publishers, which 
should publicize the number of female 
editors and reviewers (see page 47). More 
women scientists should be invited to write 
editorials, reviews and survey articles. Jour- 
nals and funding agencies should man- 
date that researchers account for gender 
in experiments and that they disaggregate 
gender data in all submitted and accepted 
papers where relevant, and in clinical tri- 
als and cohort studies as a matter of rou- 
tine. The Canadian Medical Association 
Journal and the Journal of the American 
College of Cardiology already follow such 
good practice, and Nature and The Lancet 
are considering adopting similar policies”. 

Against this background, the European 
Platform of Women Scientists, which repre- 
sents more than 12,000 female researchers 
in Europe and beyond, has urged the Euro- 
pean Commission (EC) to do six things with 
respect to Horizon 2020, the upcoming EU 
Framework Programme for Research and 
Innovation’. 

First, the EC needs to introduce bind- 
ing gender-evaluation criteria in EU 
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research-funding programmes. Second, it 
should set targets for the participation of 
women in EU-funded research projects at 
all levels (young and senior scientists, pro- 
ject leaders, consortium managers) at the 
proposal stage, sanctioning missed targets 
and publishing results. Third, the EC needs 
to set indicators for gender sensitivity in 
the research design of proposed projects. 
Fourth, it should extend the 40% target for 
women’s participation beyond advisory 
groups and evaluators’ panels to all struc- 
tures related to Horizon 2020 — including 
the ERC, Joint Research Centre, European 
Institute of Innovation and Technology, 
steering and expert groups and the like. 
Fifth, it needs to train evaluators in gender 
issues. Finally, the EC must increase funding 
for research into improving societal struc- 
tures as part of its innovation strategy (see 
go.nature.com/y7vygb). 

Motivation and participation are the 
basis of high-quality results in research — 
not biased evaluation criteria, job insecu- 
rity and glass ceilings. An academic culture 
that is transparent, democratic and sensi- 
tive to gender and diversity will benefit all 
scientists. Much has been achieved; a lot 
remains to be done. m 
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eird sisters? 


Biographies of female scientists perpetuate stereotypes, laments Patricia Fara. 


ames Watson was thrilled to catch an 

illicit glimpse of Rosalind Franklin’s 

X-ray photograph suggesting the dou- 
ble-helical structure of DNA. He was much 
less impressed by her personal appearance. 
“There was never lipstick to contrast with 
her straight black hair,” he wrote in The 
Double Helix, “while at the age of thirty- 
one her dresses showed all the imagination 
of English blue-stocking adolescents.” Had 
he been more aware of European fashion, 
he might have appreciated the care Frank- 
lin took to adopt designer Christian Dior’s 
iconic 1947 New Look — although she 
presumably never revealed to him that her 
underwear was hand-made from parachute 
silk to her own specifications. 

Watson was far from alone in believing 
that it is impossible for someone to be both 
anormal woman and a first-class scientist. 
To safeguard her reputation as a serious 
researcher, PhD student Jocelyn Bell — who 
discovered pulsars in 1967 — removed her 
engagement ring every morning before 
she went into the laboratory. Decades 
later, when president of Britain’s Royal 


Astronomical Society, she complained that 
“As a woman in physics, you certainly need 
to be a superwoman”. 

In the past, biographers and their publish- 
ers routinely squeezed female scientists into 
stereotypical roles — the frump, the whore, 
the enchantress, the underdog or the power 
behind the throne. Even Brenda Maddox, 
who criticizes Watson for his chauvinistic 
attitudes, played on gender stereotypes in 
choosing the subtitle The Dark Lady of DNA 
for her biography of Franklin. Is it not suf- 
ficiently fascinating that Franklin’s skilled 
research was crucial for Watson's fame? 

Current writers, male and female, are 
keen to distance themselves from old-fash- 
ioned approaches. Still, to boost their book’s 


appeal, they emphasize the singularity of 


their subjects. It seems that being an ordi- 
nary woman with a stellar scientific career 
is simply not enough: to be marketable, she 
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must also be odd. Dust jackets entice pur- 
chasers by rebranding an overlooked charac- 
ter as a unique female individual — in other 
words, as a weird woman. 


Converting female scientists into publish- 
ing opportunities may sell books, but it 
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> does the cause of equality in science no 
favours. Take the recent biography of Marie 
Tharp, the American geologist and carto- 
grapher who, with colleague Bruce Heezen, 
produced the first systematic map of the 
ocean floor in 1977. In Soundings, author 
Hali Felt imagines Tharp walking along the 
streets of New York, her coat unbuttoned, 
shoes scuffed and frizzy hair unbrushed. 
“She does not look like the other women, 
Felt writes — reinforcing sweeping generali- 
zations that female scientists are a race apart. 

Even sympathetic authors perpetuate 
the prejudice that brains and beauty never 
go together. Describing the film star Hedy 
Lamarr as The Most Beautiful Woman in 
the World, Richard Rhodes deliberately 
provokes a shiver of surprise by reporting 
that she also made breakthrough inventions. 
With composer George Antheil, she devised 
spread-spectrum radio, a technology now 
used in many applications, including cord- 
less phones. “Any girl can be glamorous,” 
Lamarr is reported to have said. “All you 
have to do is stand still and look stupid” 
Despite her intelligence, Lamarr’s remark- 
able looks and glamorous career occluded 
her innovations for decades. 

Science’s most famous heroine is surely 
Marie Curie. Over the decades, biographers 
have caricatured her as various unrealistic 
and undesirable ciphers, most notably the 
adulterous opportunist and the martyr to 
science. When her husband was killed — 
according to some such romances — she 
ensured her continuing success by latch- 
ing on to his married colleague, Paul Lan- 
gevin. Meanwhile, eulogies of the pioneer 
who dared to behave differently stress that 
she was often too absorbed in her work to 
eat, sacrificing her health as well as her 
appearance to the higher cause of 
research. Downplaying Curie’s 
theoretical achievements, they 
portray her as a dedicated worker 
who spent months systematically 
sieving tonnes of pitchblende — 

a mindless, repetitive task with 
echoes of domestic drudgery. 

Modern biographers may have 
abandoned such facile renderings, 
but they behave as if it were unthink- 
able to criticize an icon. Thus, in 
Marie Curie and Her Daughters, 
Shelley Emling presents the 
physicist as a doting mother to 
Eve and Iréne, even though the 
evidence suggests otherwise. 
Birthday after birthday, Curie 
chose to be away, sending let- 
ters that overflowed with love 
and regrets, yet enclosed extra 
homework. Eve reported that her 
parents regarded radium as their 


Dorothy Wrinch with her protein model. 


third child, and they seem to have treated 
their human offspring as an experimental 
research project. Marie remained physically 
and emotionally remote, while meticulously 
recording the girls’ clothes, diet and aca- 
demic progress in her notebooks. 

A less familiar name is that of mathema- 
tician Dorothy Wrinch, often labelled a 
harridan because of her forceful manner 
among her male peers. Like Curie, she was 
censored for behaving like a man — with 
ruthless ambition. The first woman to 
receive a doctor of science degree from the 
University of Oxford, UK, Wrinch devel- 
oped a theory about the molecular structure 
of proteins that, although later discredited, 
ultimately contributed to genetics. 

In contrast with Felt’s book about Tharp, 
Wrinch’s name does at least appear on the 
cover of Marjorie Senechal’s biography, 
although relegated to the subtitle in favour 
of the eye-catching but misleading I Died for 
Beauty. Wrinch was indeed fascinated by the 
aesthetic appeal of mathematical truths, so 
the title and the pink cover motif implicitly 
trivialize her intellectual abilities. 

Another infuriating feature of many 
biographies about women scientists 
is their use of first names and gushing 
prose. Senechal adopts the nickname 
‘Dot’ for Wrinch — but presumably she 
would not have converted Michael Fara- 
day into Mike, or Albert Einstein into Al. 
Similarly, although Maddox writes in an 
accessible yet dignified style, she insists 
on calling Franklin ‘Rosalind’. So why 
does she use surnames for Watson and 
his male colleagues? Presumably, this 
patronizing practice is intended to foster an 
impression of cosy familiarity, but it signals 

once again that women who go into 
science are venturing into an alien 
male world. 
In bids to restore women 
scientists’ reputations, ful- 
someness over their work’s 
significance is as distorting as 
demeaning familiarity. True 
equality cannot be attained 
until women can accept 


ited. All rights reserved 


Hedy Lamarr co-invented spread-spectrum radio. 


criticism without taking the easy route of 
complaining about gender bias. Ideas are 
often rejected simply because they are not 
good enough: the US Navy may have been 
justified in dismissing Lamarr’s projected 
guidance system as too bulky to be valuable. 
And failing to win a Nobel prize need not 
mean that a woman is a wronged genius: 
Franklin’s X-ray photograph proved crucial 
in the race to find the structure of DNA, but 
Crick and Watson did get there before her. 

By perpetuating stereotypes, books affect 
how people think. When I was in my early 
twenties, I resolved never to confess that I 
had a degree in physics from the University 
of Oxford: I knew from experience that any 
potential suitor would immediately assume 
I slotted into one or other of the ‘strange 
woman scientist’ categories. And schoolgirls 
are still being steered, as I was, into math- 
ematical and technical subjects by teachers 
serving the cause of political correctness. I 
migrated immediately after graduating not 
because I was incapable of tackling phys- 
ics or because I was intimidated by being 
in an environment dominated by men, but 
because I was bored by the repetitive practi- 
cal work. 

Biographers can shift attitudes, but they 
need to celebrate their subjects for being 
special scientists, not marvel at them as 
weird women. Just like men, female sci- 
entists have individual personalities and 
idiosyncrasies, and they have weaknesses 
as well as extraordinary capabilities — not 
because they are women, but because they 
are human beings. = 


Patricia Fara is a historian of science at the 
University of Cambridge, UK. 
e-mail: pf10006@cam.ac.uk 
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Techno-fix troubles 


Two analyses challenge the idea of the Internet as a 
panacea for social ills, finds Nicholas Carr. 


Engines of Culture (Transaction), social 

scientist Daniel Fox lamented the rise 
of “technocratic solutionism” Frustrated 
by the messiness of politics, intellectuals 
were retreating to a simplistic view of social 
progress, predicated on a belief that “prob- 
lems have technical solutions even if they 
are the result of conflicts about ideas, values 
and interests”. In technology’s promise of 
the quick fix, disheartened thinkers found 
comfort. 

Some 20 years on, the appeal of solution- 
ism is stronger than ever, thanks to rapid 
advances in the analytical and communi- 
cative powers of computers. The hopes of 
today’s solutionists centre on the Internet. 
In its decentralized, ‘peer-to-peer’ architec- 
ture, they see a model for a more democratic 
polity. And in its bulging databases, they see 
a digital Rosetta Stone that, once decoded, 
will allow us to decipher the causes of social 
ills from obesity to government corrup- 
tion. If we can just get the algorithms right, 
the thinking goes, we'll be able to solve our 
most intractable problems in an illuminat- 
ing burst of statistical analysis. 

The Internet has been around for long 
enough to put its curative powers to the 
critical test. Two authors — social theorist 
Evgeny Morozov and computer scientist 
Jaron Lanier — argue independently that 
the Net is too blunt an instrument to solve 
complex societal problems. Far from being 
a cure-all, the network actually aggravates 
some maladies, such as the concentration 
of economic power, that many assumed it 
would remedy. 

In To Save Everything, Click Here, 
Morozov provides an astute, if sometimes 
shrill, critique of contemporary solution- 
ists and their reductive assumptions. He 
describes the way in which “Internet cen- 
trism” has skewed our discussions of eve- 
rything from law enforcement to public 
health. It may be tempting, he writes, to 
recast complicated cultural and political 
phenomena as “transparent and self-evi- 
dent processes that can be easily optimized”, 

but this usually cul- 
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bromides: embrace “openness”, “sharing” 
and “virality”, let information do its thing, 
and our problems will solve themselves. 

Morozov points to the rise of ‘crowd- 
funding’ as an example. Online exchanges 
such as Kickstarter broaden the reach of 
venture capitalism by allowing people to 
make small investments in commercial and 
creative projects being undertaken by indi- 
viduals and small businesses. The US singer 
Amanda Palmer, 


for example, raised Wealth 
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and an open market- 

place is intoxicating to today’s efficiency- 
minded technophiles, who have been quick 
to promote such sites as replacements for 
cash-strapped arts councils. 

But, Morozov contends, crowd-funding 
is in thrall to the herd instinct. It funnels 
money towards endeavours that generate 
buzz rather than demonstrate merit, and it 
encourages artists to act as marketers and 
hucksters. He points to a recent study of 
documentary film-making in Britain that 
suggests that online contributors tend to 
concentrate their money in polemical fea- 
tures that promote a fashionable “activist 
agenda”. They are much less likely to back 
documentaries that seek to explore conten- 
tious issues objectively and in depth. Crowd- 
funding is a solution only if you misjudge 
the problem. 

Although Morozov is right to stress how 
technological determinism can warp politi- 
cal debates, he ends up going too far in the 
opposite direction. He claims that “the Inter- 
net” — his quotation marks — is largely a 
rhetorical construct, a sort of popular myth, 
and that it lacks any inherent qualities that 
might shape the behaviour of its users. 
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Digital technologies, he asserts, “are not the 
causes of the world we live in but rather its 
consequences”. This is a naive view of large- 
scale networks, and it lets Morozov sidestep 
difficult questions about the way the Net, 
like the highway system and the electric grid 
before it, moulds our economy and culture 
in its own image. 

Lanier offers a more searching examina- 
tion of the Internet’s defects in Who Owns 
the Future?. The Net’s workings, he argues, 
have been shaped by an ideology that, 
although well-intentioned, has deformed 
our commercial and social relationships. 
By mistaking free information for freedom, 
the network's designers and defenders have 
inadvertently created a system that central- 
izes power and profit. Companies such as 
Google and Facebook take in billions of 
dollars by hosting online exchanges, but 
the people who actually create whatever is 
being exchanged — words, ideas, works of 
art — often get nothing. The joy of participa- 
tion, they’re told, should be compensation 
enough. 

As digital networks come to regulate 
more of the economy, Lanier sees a perverse 
dynamic taking hold. Wealth concentrates 
around those who control the servers and 
databases, whereas risk spreads outwards to 
the masses. He points to the banking crisis 
of 2008 as an example. By erasing local mar- 
ket boundaries and controls, computerized 
financial systems helped to funnel riches to 
a handful of bankers and traders — yet when 
the system collapsed, it was ordinary citizens 
who paid the bill. 

The only way to change the dynamic is 
to redesign our computer networks to be 
a little less efficient and a little more egali- 
tarian. Lanier imagines a “symmetrical” 
web, in which every piece of information 
is linked back to the person who created it. 
Copying the information triggers a “micro- 
payment” to its creator. By placing a price 
on information, you constrain a company’s 
ability to track and manipulate people and 
to reap windfalls by exploiting massive data 
stores. 

Many of Lanier’s proposals, including his 
call to assign everyone “a universal online 
identity’, will be controversial. And some of 
them, such as the micropayments scheme 
— which would require the value of every 
Facebook update and blog comment to be 
calculated — seem far-fetched. But, like 
Morozov, Lanier does a service by chal- 
lenging us to address societal problems as 
humanists, not engineers. m 


Nicholas Carr writes on technology and 
culture. His book The Shallows: What 
the Internet Is Doing to Our Brains was 
a finalist for the 2011 Pulitzer Prize in 
General Nonfiction. 

e-mail: ncarr@mac.com 
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Anna Maria Luisa de Medici bequeathed her family’s legacy to the city of Florence in Italy. 


Medicean secrets 


Alison Abbott enjoys a scientific history of the family 
who ruled Florence during the Renaissance. 


he Medici clan held sway over 
"[rorenc and Tuscany during the 

Italian Renaissance and well beyond. 
They created one of Europe's most powerful 
banks, ruled Florence and produced four 
Popes. Today they are best remembered for 
their patronage of science, art and architec- 
ture. We have them to thank for Florence’s 
enduring beauty — including the mag- 
nificent Basilica of San Lorenzo. Designed 
by the architect Filippo Brunelleschi, this 
is where most of the family were buried, 
several after violent deaths. 

Tombs were shuffled around within San 
Lorenzo four times between the fifteenth 
and nineteenth centuries, as successive gen- 
erations tried to improve the presentation 
of their dynasty. Some remains got jum- 
bled; some labelling got lost. Then, in 1945, 
anthropologist Giuseppe Genna disinterred 
23 skeletons to make measurements that he 
hoped would support fashionable ‘anthro- 
pometric’ theories of psychological traits. 
The research was scientifically misguided 
and damaging: Genna removed all traces of 
flesh from the bones before returning them 
to their graves. Today's molecular biologists 
could have made much of those scraps. 

Now the exhibition The Medici at the Reiss- 
Engelhorn Museum in Mannheim, Germany, 
charts the family’s rise and fall, from found- 
ing father Giovanni di Bicci (1360-1429) to 
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Anna Maria Luisa de The Medici: 
Medici, the last of the People, Power and 
Passion 


clan, who died in 1743. 
It is Anna Maria Luisa 
whose crowned skull 
graced international 
news media last month 
(see http://go.nature.com/gws6g3). 

The exhibition has the familiar round-up of 
formal portraits, many on loan from the Uffizi 
in Florence, the world-famous art museum 
built for the Medici. The family was known 
from contemporary documents to have been 
plagued by disfiguring illnesses such as psori- 
asis, syphilis and arthritis, which, unsurpris- 
ingly, the flattering portraits do not reflect. 

But the paintings are complemented by 
casts of the skulls of each of those pictured. 
Some were made by Genna, others after 
more systematic exhumations — of both 
bones and the pots in which the Medici 
interred the entrails of the deceased — car- 
ried out since 2004, when the Medici Project 
was launched. This project was a collabora- 
tion of Italian scientists and the culture 
ministry to assess any damage caused by the 
catastrophic flooding of the city in 1966, to 
identify which remains belonged to whom 
and to try to work out, using molecular biol- 
ogy, what the various individuals died from. 

The exhibition weaves stories about each 
Medici from these scientific results and from 


REISS-ENGELHORN 
MUSEUM, MANNHEIM, 
GERMANY. 

Until 28 July 2013 
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analyses of contemporaneous documents. A 
couple involve murder. Beautiful, flirtatious 
Isabella (1542-76) was strangled by her jeal- 
ous husband, with the apparent encourage- 
ment of her brother. Her bones were thrown 
into a mixed grave. Researchers managed to 
identify which skull was likely to be hers, 
and used forensic techniques to make a 
facial reconstruction for the exhibition that 
closely matches a contemporaneous portrait. 
It stands startlingly apart from the other oil 
portraits in its modernity and humanity. 

Project scientists also managed to tenta- 
tively identify the entrails pot of Bianca Cap- 
pello, long-term mistress and then second 
wife of Francesco I (1541-87), whose bones 
have disappeared. Bianca and Francesco died 
within hours of each other. Scientists found 
traces of arsenic in their remains, lending 
substance to the historical claim that they 
were poisoned rather than dying of malaria 
as autopsy doctors declared at the time. 

Butit is the exhibition's centrepiece — the 
exhumation of Anna Maria Luisa — that 
propelled it into the headlines recently. 
When project scientists at the University of 
Florence opened the wooden coffin of Anna 
Maria Luisa last October, they were startled. 
Her skeleton was almost undamaged by the 
floods and topped by a crown — not the 
expected Medici death crown, but that of 
her husband’s principality of Palatinate, in 
what is now Germany’s Rhineland, where 
the museum is located. 

The scientists used a three-dimensional 
scanner to replicate the skull for palaeo- 
forensics, one of the first such applications 
of the technology. They removed a small 
fragment of bone to analyse carbon and 
nitrogen isotopes that might illuminate how 
rich the Medicis diets were in meat and fish, 
and for DNA analysis to determine her cause 
of death. Documents from the time suggest 
it could have been syphilis or breast cancer. 
Researchers also took samples for DNA test- 
ing from an unlabelled pot of entrails that 
they suspect belongs to her. The exhibition 
will be updated should results arrive. 

A final thrill for scientists visiting the 
exhibition is a relic of Galileo Galilei (1564— 
1642), on display for the first time outside 
Italy. Galileo depended on Medici patronage. 
To keep things sweet, he named the moons 
of Jupiter, which he discovered in 1610, the 
Medicean Planets. The relic — the astrono- 
mer’s fifth lumbar vertebra — has spent dec- 
ades in a safe box at the University of Padua. 
It was brought out in 2010 when NASA 
requested a fragment for its Juno mission to 
Jupiter. The Italian Space Agency, apparently 
fearing Vatican disapproval, declined. After 
the Medici exhibition, the relic will go on 
permanent display at the university. m 


Alison Abbott is Nature’ senior European 
correspondent. 
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Corrected numbers 
for fish on Red List 


Kelly Swing gives inaccurate 
numbers for marine fish species 
on the International Union for 
Conservation of Nature (IUCN) 
Red List of Threatened Species. 
He also mistakenly conflates 
the scientific process of species 
assessment for the Red List with 
the separate political process of 
IUCN member voting (Nature 
494, 314; 2013). 
About one-quarter, or 4,337, 
of some 17,000 species of marine 
fish are on the IUCN Red List 
(not fewer than 100 species out of 
25,000, as Swing writes). Of those, 
416 species have been placed in 
a threatened category (Critically 
Endangered, Endangered or 
Vulnerable) and 1,180 species 
were classed as Data Deficient. 
The IUCN Tuna and Billfish 
Specialist Group has assessed 
all 10 species of billfish and 
51 species of tuna and mackerel 
through a series of regional 
workshops, unhindered by 
the “TUCN’s worldwide voting 
procedures”. Seven species 
meet the IUCN threshold for a 
threatened category 
(B. B. Collette et al. Science 333, 
291-292; 2011). The sale of 
billfish has since been banned in 
the continental United States. 
The IUCN Marine Biodiversity 
Unit’s Global Marine Species 
Assessment programme is now 
evaluating all remaining marine 
fish, and aims to finish within 
5 years (http://sci.odu.edu/gmsa). 
Bruce B. Collette National 
Marine Fisheries Service, 
Washington DC, USA. 
collettb@si.edu 
Beth Polidoro Arizona State 
University, Phoenix, USA. 
Kent Carpenter Old Dominion 
University, Norfolk, Virginia, USA. 


Regulating stem-cell 
therapies worldwide 


Japan's drive to regulate 
experimental stem-cell 
treatments is a welcome step 
(Nature 494, 5;2013). However, 


it could be hard to define 
universally applicable criteria that 
ensure the safety and effectiveness 
of such treatments worldwide. 

There is a risk that the social 
and economic circumstances of 
patients and researchers might 
blur or compromise crucial 
criteria, such as evidence-based 
support (including adequate 
preclinical and clinical testing), 
ethical review and no-fee 
experimental treatment. 

Stem-cell therapies are all too 
often unsupported by scientific 
evidence, but many patients 
are prepared to accept the risks. 
And nations with inadequate 
health-care resources may expect 
patients to pay for experimental 
treatments. 

Proliferation of experimental 
stem-cell therapies may even 
be indirectly encouraged in 
countries such as China, where 
ultra-light regulation through 
official notification (see www. 
moh.gov.cn) has uncertain 
regulatory value. 
Margaret Sleeboom-Faulkner 
University of Sussex, Brighton, UK. 
m.sleeboom-faulkner@sussex.ac.uk 


Order health systems 
in developing world 


Governments and policy-makers 
are aiming to improve health 
markets in developing countries 
as they take up the challenge 

of last year’s United Nations 
resolution to move towards 
universal health coverage (see 
go.nature.com/acsmss). We 
caution that they must do more 
than simply legislate their way toa 
more orderly health system. 

We have highlighted the risks 
associated with unregulated 
health-care services and products 
in developing countries (Nature 
487, 163-165; 2012). We now 
wish to draw governments’ 
attention to improvements 
proposed at a conference 
on securing international 
agreements on future health 
markets. 

A small group of health policy- 
makers, entrepreneurs, academics 


and funders met in Bellagio, Italy, 
at the end of last year. Among 
their recommendations were 
that countries should establish 
systems for collecting better 
basic health-market data for 
incorporation into local policy 
and management processes, and 
that they should organize funds 
to promote sound regulatory 
practice (see go.nature.com/ 
npsdvg). 

The group called for a major 
effort by all market players to test 
innovative regulatory approaches 
and business models to improve 
access to safe and effective health 
services in the developing world. 
David H. Peters Johns Hopkins 
Bloomberg School of Public 
Health, Baltimore, Maryland, 
USA. 
dpeters@jhsph.edu 
Gerald Bloom University of 
Sussex, Brighton, UK. 


Modelling genetics 
within ecosystems 


Safeguarding genetic diversity 
is one of the Convention on 
Biological Diversity’s main 
targets, because genetics 
underpins ecosystems. General 
models of whole ecosystems 
(D. Purves et al. Nature 493, 
295-297; 2013) therefore need to 
incorporate genetic data if they 
are to represent natural systems 
and guide conservation policy. 
Yet it is important to 
understand that species 
diversity and genetic diversity 
do not always correlate; that the 
interplay between phylodiversity 
and functional diversity can be 
highly complex and regulated 
by interactions between 
cornerstone species; and that 
ecosystem resilience is tied to 
evolutionary history and genetic 
diversity. An ecosystem’s full 
genetic potential, as represented 
by ancient lineages and 
maximally diverse taxa and key 
species, must be realized. 
Niall McCann, Pablo Orozco 
ter Wengel, David Stanton 
Cardiff University, Cardiff, UK. 
mecannnp@cardiff.ac.uk 


Drew Purves et al. reply: It could 
be useful to incorporate genetics 
into general ecosystem models 
(GEMs), along with complexities 
such as stoichiometry or long- 
range migrations. Genetically 
based GEMs could interface 
with the increasing amounts of 
genetic data available to capture 
the effects of individual-scale 
adaptation, although this can be 
approximated without explicit 
genetics (for example, traits 
mutate in the Madingley model 
we describe in our Comment). 
However, such complexities 
bring greater computational 
demands and increased model 
freedom, which might lead to 
the model producing almost 
any output unless properly 
constrained with data. 


Evaluate gender 
equality in journals 


The European Association of 
Science Editors established 

a gender policy committee 

last year to develop a set of 
standards for adoption by 
scientific journals. As co-chairs 
of the committee, our first step 
is to invite science editors to 
contribute to a survey of gender- 
equality policies in their journals 
(see go.nature.com/wor7ks; 
survey closes on 10 April). 

In this survey, we ask editors 
for their views on considering 
sex and gender in experimental 
design and data analysis, and on 
presenting data that are broken 
down by sex. Information is also 
requested on gender balance and 
its promotion among editorial 
staff, editorial boards and peer 
reviewers. 

Our hope is that all journals 
will eventually follow Nature's 
example in promoting gender 
equality in science (see www. 
nature.com/women). 

Shirin Heidari Journal of the 
International AIDS Society, 
Geneva, Switzerland. 
shirin.heidari@iasociety.org 
Tom Babor University of 
Connecticut School of Medicine, 
Farmington, Connecticut, USA. 
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Rewriting a genome 


A bacterial enzyme that uses guide RNA molecules to target DNA for cleavage has been adopted as a programmable tool 
to site-specifically modify genomes of cells and organisms, from bacteria and human cells to whole zebrafish. 


EMMANUELLE CHARPENTIER 
& JENNIFER A. DOUDNA 


versity in Japan reported an apparently 

minor finding. While investigating the 
sequence of a bacterial gene that encodes the 
enzyme alkaline phosphatase, they discovered 
an unusual segment of neighbouring DNA 
that consisted of short, directly repeating 
nucleotide sequences flanked by short unique 
segments’. They noted that “the biological 
significance of these sequences is not known’. 
Fast-forward almost three decades, and what 
initially seemed to be an obscure observation 
is now being used to open the door to easy 
manipulation of the genomes of a multitude 
of organisms. Five papers published within a 
month of each other, in Science”’ and Nature 
Biotechnology*®, report the application of 
such bacterial sequences — now referred to 
as CRISPR-Cas systems’ — as a simple and 
versatile tool for genomic editing. 

As whole-genome sequencing became 
routine in recent decades, regions containing 
CRISPR (clustered regularly interspaced short 
palindromic repeat) sequences and CRISPR- 
associated (Cas) genes were found in a wide 
variety of bacteria and archaea* ’. The discov- 
ery’”"' that the short unique sequences in these 
arrays matched DNA sequences from viruses or 
plasmids (small non-chromosomal DNA mol- 
ecules that can be transferred among bacteria 
and archaea) hinted that CRISPR-Cas systems 
encode ‘adaptive’ immune systems, provid- 
ing specific defences against invaders. Subse- 
quent genetic and biochemical experiments 
confirmed this speculation by showing that 
CRISPR-Cas systems allow detection of and 
protection against mobile genetic elements”. 

Although some CRISPR-Cas systems 
require multiple proteins to function”, the 
type II systems found in many bacteria'*"*"® 
use a single endonuclease, Cas9 (Fig. 1). 
This enzyme acts together with guide 
RNA to locate and cleave invading DNA at 
sites demarcated by conserved sequences 
called proto-spacer adjacent motifs 
(PAMs)”’”'®. To form a functional DNA- 
targeting complex, Cas9 requires two distinct 
RNA transcripts, CRISPR RNA (crRNA) 
and trans-acting CRISPR RNA (tracrRNA)””’. 


|: a 1987 paper, researchers at Osaka Uni- 
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Figure 1 | Targeted genome editing with RNA-guided Cas9. The enzyme Cas9 is a DNA endonuclease 
found in many bacteria, in which it functions as part of a defence system against invading DNA 
molecules, such as viruses. Cas9 has two active sites that each cleave one strand of a double-stranded 
DNA molecule. The enzyme is guided to the target DNA by an RNA molecule that contains a sequence 
that matches the sequence to be cleaved, which is demarcated by PAM sequences. RNA-guided 

Cas9 activity creates site-specific double-stranded DNA breaks, which are then repaired by either 
non-homologous end joining or homologous recombination. During homologous recombination, the 
addition of donor DNA enables new sequence information to be inserted at the break site. Several recent 
papers show that RNA-guided Cas9 systems can be used to engineer the genomes of human and mouse 
cell lines” *””, bacteria* and — by modifying one-cell-stage embryos — zebrafish’. 


However, it was recently shown’ that this dual 
RNA can be reconfigured as a single-guide 
RNA (sgRNA) that includes sequences that 
are sufficient to program Cas9 to introduce 
double-stranded breaks in target DNA. As the 
new publications show, RNA-guided Cas9 can 
function in a variety of cells and organisms to 
cleave intact genomes at specific sites. And this 
is the point at which the potential for genome 
editing comes in. When the double-stranded 
breaks are repaired by standard cellular repair 
mechanisms, either by homologous recom- 
bination (the exchange of genetic informa- 
tion between DNA molecules with similar 
sequences) or non-homologous end joining 
(NHB); the introduction of insertions or dele- 
tions into the sequence), the sequence at the 
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repair site can be modified or new genetic 
information inserted. 

Three of the studies demonstrate that RNA- 
programmed Cas9 can function in human 
cells. Cong et al.’, Mali et al.’ and Cho et al.* 
engineered versions of the Cas9 enzyme from 
the bacterium Streptococcus pyogenes so that 
it would be active in the nucleus of human 
cells, and designed dual RNAs or sgRNAs 
that included a 20-nucleotide sequence com- 
plementary to human target DNA sequences. 
When the researchers expressed the ‘human- 
ized’ Cas9 together with these guide RNAs in 
various human cell lines, including induced 
pluripotent stem cells, they observed the 
expected alterations to the target DNA 
— achieved through the introduction of 


double-stranded breaks followed by repair. 
The gene-targeting achieved up to 38% success 
and was accompanied by only a low level of 
Cas9 toxicity. The RNA-guided Cas9 was also 
efficient at triggering targeted gene replace- 
ment at normal genomic sites in human cells. 
In another paper published in the same month, 
Jinek et al.” show that RNA-programmed Cas9 
functions in human cells to trigger site-specific 
genome modifications, and that the ability of 
Cas9 to assemble with guide RNA in cells is a 
limiting factor in this activity. 

On the basis of earlier observations that 
single-stranded DNA breaks can favour 
homologous recombination and reduce off- 
target mutagenesis, Mali et al.’ and Cong 
et al.” also tested versions of Cas9 that have 
been shown’ to act asa nickase enzyme — one 
that breaks only one strand of a DNA mol- 
ecule. The mutated enzymes had lower rates 
of NHE] but were as efficient as the wild-type 
endonuclease at gene replacement triggered 
by homologous recombination. Both groups 
also demonstrated further functionality of 
the system in ‘multiplexed’ targeting; the 
expression of ssRNA-programmed Cas9 that 
can bind to two different genomic sequences 
led to sequence disruption at more than one 
independent target site. In addition, Cong 
and colleagues show that gene-disruption effi- 
ciency could be improved upon independent 
expression of the two RNA components of the 
original dual-tracrRNA-crRNA combination. 
This finding implies that improved design of 
sgRNAs may allow them to better mimic the 
dual RNA structure’. 

In addition to these results in cell lines, 
RNA-guided Cas9 can be used to engineer 
genomic changes in intact organisms. Jiang 
and colleagues” show that the system can be 
used in bacteria to modify multiple sites by 
programming Cas9 with several different 
guide RNAs ina single cell. This technology 
could be exploited to engineer microorgan- 
isms that are otherwise genetically intractable 
to harbour pathways for producing biofuels 
and molecules of therapeutic value. Working 
with zebrafish, Hwang and colleagues’ show 
that injection of one-cell-stage embryos with 
Cas9-encoding mRNA and appropriate guide 
RNAs produced high frequencies (24-59%) 
of targeted insertions and deletions at eight of 
ten sites in all embryos tested. These findings 
hint that RNA-guided Cas9 might be useful 
for engineering other multicellular organ- 
isms, including mammals and plants. One of 
the most exciting potential uses of such tech- 
nology would be to provide a straightforward 
means of generating animal models of human 
disease. 

Genome engineering by RNA-pro- 
grammable Cas9 promises to have broad 
applications in synthetic biology, direct 
and multiplexed perturbation of gene net- 
works, and targeted ex vivo and in vivo gene 
therapy*’. The next challenges will be to 


analyse and address possible off-target effects 
and improve the efficiency and specificity 
of the system, while expanding its use to 
other organisms. In this regard, it will be 
important to compare RNA-programmed 
Cas9 with existing genome-editing tools", 
including meganucleases, ZFNs (zinc-finger 
nucleases) and TALENs (transcription acti- 
vator-like effector nucleases). In addition 
to genome editing, this approach offers the 
exciting possibilities of transcriptional gene 
silencing using an inactive Cas9 (ref. 20) or 
engineering Cas9 to have new functions, 
such as transcriptional activation. The dis- 
covery and application of bacterial systems, 
such as restriction enzymes and thermostable 
polymerases, have revolutionized molecular 
biology in the past. With RNA-guided Cas9 
enzymes, bacteria now offer a versatile tool 
for rewriting genomic sequence information 
that has the potential to reshape the genome- 
engineering landscape in biotechnology and 
medicine. m 
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An accurate distance 
to the nearest galaxy 


By having a highly accurate value for the distance to the Large Magellanic Cloud 
galaxy, astronomers can get a better measure of cosmic ‘dark energy’. Using 
binary stars, they have now achieved a value accurate to 2.2°%. SEE LETTER P.76 


BRADLEY E. SCHAEFER 


istances to celestial bodies are crucial 
D: astronomy. They allow astrono- 

mers to understand the structure of 
the Universe; for example, to see the organiza- 
tion of the Solar System and to recognize that 
galaxies lie beyond the Milky Way. The derived 
physical sizes of bodies scale with the distances 
adopted for them, whereas their energetics 
scale with the square of the distances. A cur- 
rent hot enterprise is to use distance meas- 
urements to the farthest supernovae to map 
out the expansion history of the Universe 
and to uncover the nature of the Universe’s 
mysterious dark energy. Distances are deduced 
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by means of a ‘distance ladder’: knowledge of 
the distances to nearby bodies is used to deter- 
mine the distances of bodies farther out, and 
so on to yet more remote objects. On page 76 
of this issue, Pietrzynski et al.' claim to pro- 
vide a much-needed, highly accurate measure 
of the distance to the Large Magellanic Cloud 
galaxy — the bottleneck in the ascent of the 
distance ladder. 

Historically, the lowest ‘rung’ of the distance 
ladder, the size of Earth, was used to calibrate 
the timings of the transit of Venus across the 
Sun, and so to climb to the second rung, the 
Earth-Sun distance. The method of paral- 
lax — watching stars wobble back and forth 
as Earth orbits the Sun — was used to climb 
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Figure 1 | The Large Magellanic Cloud. Pietrzyriski et al.’ have determined an accurate distance to the 
Large Magellanic Cloud galaxy, the ‘anchor’ point from which distances to other galaxies can be measured. 


to the third rung, the distance to nearby stars. 
For cosmology and extragalactic astronomy, 
almost all ways of climbing the distance 
ladder pass through the rung of the Large 
Magellanic Cloud (LMC; Fig. 1), which is the 
evocatively named nearest galaxy to the Milky 
Way. So if the LMC distance is off by 10%, 
then the distances to all galaxies (as quanti- 
fied with the Hubble constant, the expan- 
sion rate of the Universe) will be in error by 
the same 10%. When combined with meas- 
ures of far-off supernovae, a highly accurate 
distance to the LMC will also substantially 
improve measurements of the properties of 
dark energy, and thereby potentially reveal 
its nature. 

The distance to the LMC has long been a 
contentious question. Published values over 
the decades have been scattered, with val- 
ues from the decade before 2001 alone hav- 
ing a scatter of 36%, despite the fact that the 
reported error bars are much smaller than this 
scatter’. So it is clear that the many methods 
used to determine the LMC distance had large 
and unidentified systematic errors. In 2001, the 
influential Hubble Space Telescope Key Project 
presented a middle value of the previous scatter 
with a reasonable error bar (50.1 + 2.3 kilo- 
parsecs, or 163,400 + 7,500 light years)’. But its 
publication created a bandwagon effect, and for 
the next six years the astronomy community 
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published distances that slavishly followed this 
value*. The 2011 Nobel laureate Adam Riess 
and the SHOES project have since adopted 
a method pioneered by a group centred at 
Villanova University in Pennsylvania** 
(involving four eclipsing binary stars) to obtain 
an LMC distance of 49.8 kiloparsecs, with an 
accuracy to 3% (ref. 9). 

Now Pietrzynski et al., as part of the interna- 
tional Araucaria Project, derive a value that is 
accurate to 2.2%. They achieved this by meas- 
uring the distances to eight eclipsing binaries 
in the LMC. (Eclipsing binaries are two-star 
systems whose component stars regularly pass 
directly in front of each other as viewed from 
Earth.) They measured accurate sizes for both 
stars in each binary (from the durations of 
the eclipses and the measured velocities) and 
their surface temperatures (from spectra of the 
stars), thereby deriving the total intrinsic lumi- 
nosity of each binary system. By comparing the 
total luminosity and the observed brightness, 
and using the inverse square law of light, they 
then determined the distances to the binaries. 
This method inspires confidence because it 
uses only undergraduate astronomy and phys- 
ics (but very carefully done), and completely 
jumps over all the lower rungs of the distance 
ladder. The authors new distance to the LMC, 
which is based on the average of the distances 
to the eight binaries, is 49.9741.13 kiloparsecs. 
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A comparison of the new LMC distance with 
previously published values reveals three dis- 
turbing issues. First, the Araucaria Project had 
previously reported one of its eclipsing bina- 
ries to have a distance of 50.1+1.4 kiloparsecs”®, 
but it now quotes 49.3+0.5 kiloparsecs for that 
binary, with no indication of how its analysis 
has changed. Second, and much more dis- 
turbingly, the four eclipsing binaries reported 
by the Villanova group** have an average 
distance of 47.1+1.4 kiloparsecs, which is 
substantially different. The main difference 
between the groups’ techniques is the use of 
cool stars by the Araucaria Project, together 
with an empirical surface-brightness-versus- 
colour relation to determine the stars’ surface 
temperature, as opposed to the use of hot 
stars by the Villanova group and theoretical 
models to infer the temperature. Third, the 
new distance is remarkably close to the value 
obtained by the Key Project, with a differ- 
ence of only one-tenth of the error bar quoted 
by Pietrzynski et al., so we should recall the 
history of the bandwagon effect. 

What can we expect in the future? Studies 
of roughly a dozen more eclipsing binaries are 
in the pipeline, with results expected soon. 
Also, extensions of the technique will be made 
to the eclipsing binaries in the galaxy M33, in 
the constellation Triangulum. But the entire 
scene will change when the Gaia spacecraft 
is launched later this year, with its awesome 
capability for measuring accurate distances 
by means of the parallax, even out to the 
LMC. Gaia will provide definitive calibration 
of various ‘standard candles’ used to climb 
rungs above the LMC on the distance ladder, 
thereby eliminating the bottleneck at the LMC. 
Indeed, Gaia will obtain parallax data for many 
stars in the LMC with the resultant average 
accurate to 1%, and will map out its struc- 
ture. So the time spent by the LMC’s eclipsing 
binaries at the forefront of astrophysics will be 
limited to only the next few years. m 
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STEM CELLS 


A unifying theory 
for the crypt 


A long-standing ambiguity has been whether quiescent cells located in intestinal 
crypt structures are stem cells. The answer seems to be yes and no, depending on 
how one defines the term stem cell. SEE ARTICLE P.65 


HANS CLEVERS 


he epithelial cell layer that covers the 
Tien intestine is the self-renewal 

champion of the mammalian body — 
the life cycle of its cells is only 4-5 days. As 
a result, this epithelium, combined with its 
characteristic architecture of crypt-villus 
units, has become the model-of-choice for 
studying adult mammalian stem cells. The 
relationship between, and the functions of, two 
stem-cell populations (one rapidly cycling, the 
other quiescent) at intestinal crypts has been a 
matter of great interest and debate. On page 65 
of this issue, Buczacki et al.' use an ingenious 
strategy to genetically mark quiescent crypt 
cells and thus shed light on these issues*. 

The operational definition of adult stem 
cells has only two criteria. Such cells must be 
capable of self-renewal and persistence over 
the lifetime of an animal, and be multipotent 
— have the capacity to generate all cell types of 
a tissue. In addition, all types of stem cell have 
been tacitly assumed to be quiescent ever since 
haematopoietic stem cells of the blood system 
were noted to divide infrequently*. The ability 
of stem cells to retain molecules that have been 
introduced experimentally, to label their DNA 
or chromatin (DNA-protein complexes), has 
made detection of the non-proliferative state 
easy, and has made label retention a popular 
surrogate stem-cell marker. 

After their formation from stem cells at 
crypt bottoms, young epithelial cells prolifer- 
ate intensely for 2 days before differentiating 
and exiting the crypts. The cells then move 
onto the villus flanks to process nutrients and 
finally die at the villus tips on days 4—5. Paneth 
cells also derive from stem cells, but persist for 
1-2 months at the crypt base — in which they 
are thought to maintain stem-cell renewal and 
provide protection against bacteria. 

Two alternative identities for crypt stem cells 
were defined decades ago. On the one hand, it 
was proposed’ that small proliferative cells that 
are wedged between the Paneth cells represent 
the crypt stem cells. Much later, the protein 
Lgr5 was discovered’ to be an exquisite marker 
for these cells, allowing formal, experimental 
proof of their ‘stemness’ in vivo. Subsequently, 


*This article and the paper under discussion? were 
published online on 27 February 2013. 


single Lgr5-expressing (Lgr5*) cells were 
shown’ to form ever-expanding, transplant- 
able ‘mini-guts’ in vitro. On the other hand, the 
presence of DNA-label-retaining cells (LRCs) 
was noted® directly above the Paneth cells, at 
position +4 (counting Paneth-cell nuclei from 
the bottom of the crypt). Several molecular 
markers were reported for these quiescent 
‘+4 LRCs; including the protein Bmil (ref. 7). 
Confusingly, however, Lgr5” cells also robustly 
express these +4 markers*. 

Buczacki and colleagues reassess the nature 
of crypt LRCs. The authors labelled all crypt 
cells (except the long-lived Paneth cells) by 
briefly expressing in them the stable chro- 
matin marker histone 2B fused to the yellow 
fluorescent protein (YFP). What happened 
over time was intriguing. After this pulse, 
all of the dividing cells rapidly diluted out 
the label. A week later, cells still carrying the 
label — the LRCs — were seen only near the 
crypt base, and included Paneth cells. In the 
initial weeks after labelling, non-Paneth-cell 
LRCs were also evident and surprisingly co- 
expressed Lgr5, Paneth-cell markers and +4 
markers. Buczacki et al. concluded that this 
second LRC class represents non-dividing 
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precursors of Paneth cells (Fig. 1). 

To test the properties of these cells, the 
authors devised a strategy that directly 
exploits the quiescent state to genetically 
mark the non-dividing LRCs. Specifically, 
they briefly expressed histone 2B fused to the 
amino-terminal segment of the enzyme Cre 
recombinase, CreA, and found that — like 
histone-2B-YFP — this fusion protein was 
retained only in the chromatin of non-divid- 
ing Paneth cells and their presumed precur- 
sors. The other part of the enzyme, CreB, was 
expressed constitutively in all cell types. 

Reassociation of CreA and CreB into 
functional Cre recombinase depends on the 
presence of an injectable small chemical. Two 
weeks after the pulse, Buczacki et al. injected 
this chemical into mice to genetically mark the 
LRCs with the now-intact Cre recombinase. In 
healthy mice, the marked cells failed to divide. 
When crypts were damaged, however, these 
LRCs generated large, persistent crypt—villus 
ribbons — the telltale signature of crypt 
stem cells. 

The authors’ genetic-labelling approach 
is original, allowing documentation of LRC 
behaviour over time. Because this technol- 
ogy for the first time directly exploits the 
label-retention trait to mark stem cells, it will 
be broadly applicable to the study of other 
quiescent stem cells. Indeed, it will now be 
possible to investigate the dynamics of stem- 
cell compartments in other tissues in which 
quiescent stem cells have been postulated to act 
at the base of the self-renewal hierarchy, includ- 
ing bone marrow, hair follicle, muscle and brain. 

Buczacki et al. also show that isolated LRCs 
form mini-guts in culture, confirming the con- 
clusions from their Cre-recombinase data that 
these cells can regain a stem-cell-like behav- 
iour. Similar observations have been made’ for 
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Figure 1 | Complexities of intestinal stem cells. a, The intestinal epithelium follows the distinct 
contours of villus-crypt units in the intestine. b, Normally, Lgr5-expressing stem cells (Lgr5*) lead to 
the production of precursor cells that further differentiate into the various types of crypt epithelial cell. 
Buczacki et al.’ report that precursors of one type of differentiated intestinal cell, Paneth cells, can persist 
for several weeks in a quiescent state before maturing into Paneth cells. Intriguingly, these quiescent 
precursors can revert back into Lgr5* stem cells following crypt damage. 
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the proliferative, more primitive, Dlll-express- 
ing cells, which are precursors to all intestinal 
secretory cells. 

The current study unifies earlier theories 
about the identity of crypt stem cells*®. The 
cycling Lgr5* stem cells are the engines of 
crypt self-renewal. They generate a transient 
population of non-dividing daughter cells 
that are destined to differentiate into relatively 
long-lived Paneth cells. These daughter cells 
express Lgr5, as well as markers for the pro- 
posed quiescent +4 stem cells; they probably, 
therefore, represent the +4 cells. As individual 
cells, LRCs disappear over a few weeks and so 
cannot be considered stem cells. However, 
LRCs are constantly being generated by the 
cycling Lgr5* stem cells and so — as has also 
been proposed elsewhere'” — LRCs are always 
available as ‘reserve stem cells’ that can be 
called into action if tissue is damaged (Fig. 1). 
This surprising design of the crypt stem-cell 
hierarchy reconciles many of the paradoxical 
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observations that have been seen in this adult 
stem-cell compartment. m 
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A step closer toa 
methanol economy 


Aruthenium catalyst has been developed that, at a few parts per million, releases 
hydrogen gas from methanol, a simple bulk chemical. The advance might allow 
methanol to be used as a hydrogen source for fuelling vehicles. SEE LETTER P.85 


DOUGLAS W. STEPHAN 


ossil fuels are a worry. For example, 
f the phenomenon of global warming is 

linked to the presence of atmospheric 
greenhouse gases arising from their use’. Com- 
bustion of these fuels has provided conveni- 
ent energy for centuries, and so there are also 
questions about the extent of remaining, acces- 
sible oil supplies”. Combine these issues with 
environmental concerns, and with humans’ 
seemingly insatiable thirst for energy, and the 
result is a strong motivation to seek alternative 
energy sources. In this issue (page 85), Nielsen 
et al.” report a dramatic advance that might 
pave the way to the practical use of one such 
alternative: hydrogen gas derived from the 
solvent methanol*. 

The use of ‘traditional’ alternatives to fossil 
fuels, such as hydroelectricity and nuclear 
power, has increased, and technologies that 
exploit nature’s wind, solar and tidal power 
are also being developed to supplement the 
energy needs of stationary users. For trans- 
portation applications, the advent of hybrid 
technologies — such as cars that use electric 


*This article and the paper under discussion® were 
published online on 27 February 2013. 
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motors alongside petroleum-driven combus- 
tion engines — may reduce fossil-fuel con- 
sumption. However, a longer-term alternative 
to fossil fuels has been the subject of specula- 
tion by scientists, business leaders and futurist 
visionaries alike’. 

A leading candidate for a ‘clean’ energy 
source is hydrogen, because the only product 
of its combustion is water’. This would seem 
to offer a solution to energy, pollution and 
greenhouse-gas problems in one fell swoop. 
But there are downsides. Public acceptance of 
hydrogen-based technologies is undermined 
by the gas’s flammable nature — a fact indel- 
ibly linked to the explosion of the Hindenburg 
passenger airship in 1937. Finding a way to 
store large quantities of hydrogen in a portable 
volume is also a challenge. And even if these 
issues were overcome, distributing hydrogen 
to the millions of potential users would require 
massive new infrastructure. 

Nobel laureate George Olah and his col- 
leagues have advocated methanol (CH;0OH) 
as an alternative fuel, and have proposed the 
development of a “methanol economy”®. Per- 
haps the most compelling aspect of their vision 
is the fact that methanol is hydrogen-rich 
(12.6% by weight) and, because it is a liquid 
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at ambient temperatures, could be easily dis- 
tributed using the existing infrastructure for 
petroleum. The development of chemical 
methods for extracting hydrogen from meth- 
anol is crucial to the success of this strategy. 
Currently, the best systems use platinum- 
based catalysts operating at temperatures 
above 200°C, which limits the efficiency of 
methanol-based fuel cells to about 40%. 

Nielsen et al. now report ruthenium-based 
molecules that catalyse the highly efficient lib- 
eration of hydrogen from methanol in water 
at less than 100°C. The ruthenium catalysts 
ensure that, for each molecule of methanol and 
water consumed, three molecules of hydrogen 
are generated, along with one molecule of car- 
bon dioxide (Fig. 1a). The authors observed 
that the reaction is facilitated in strongly basic 
solution, which also sequestrates the CO, 
almost quantitatively as carbonate or formate 
salts. Using such strongly basic conditions, the 
researchers found that the ratio of hydrogen to 
CO, in the evolved gas is consistently greater 
than 500:1. 

To optimize the reaction conditions, Nielsen 
and colleagues studied the effects of catalyst 
concentration, the ratio of methanol to water, 
the basicity of the solution, and temperature. 
In the best case, the authors found that as little 
as 1.8 parts per million (p.p.m.) of the cata- 
lyst produced hydrogen from a 9:1 mixture 
of methanol to water, using a high concentra- 
tion (8.0 moles per litre) of the base potassium 
hydroxide at 91°C. Under these conditions, 
approximately 2,700 equivalents of hydrogen 
per equivalent of the catalyst were liberated 
every hour. This ‘turnover frequency’ rose to 
4,700 equivalents per hour if pure methanol 
was used as the solvent, with just 1.6 p.p.m. 
of catalyst. However, for real-world appli- 
cations in fuel cells, methanol—water mix- 
tures and lower concentrations of base will 
be required. 

To test their catalysts in real-world condi- 
tions, Nielsen et al. decreased the ratio of 
methanol to water to 4:1, used a much lower 
concentration of a different base (0.1 moles per 
litre of sodium hydroxide) and increased the 
catalyst concentration (to 21 p.p.m.). Under 
these conditions, the initial catalyst-turnover 
frequency decreased to about 800 equivalents 
per hour. The researchers observed that the pH 
of the solution fell from 13 to 10 during the first 
4 hours of reaction and that the ratio of hydro- 
gen to CO, in the produced gas changed from 
9:1 to 3:1 over the same period. However, the 
composition of the gas mixture then remained 
stable for up to a further 3 weeks of operation, 
equivalent to a remarkable 350,000 turnovers 
of hydrogen from the catalyst. 

The authors did not test their catalyst 
in methanol fuel cells, but their findings 
suggest that the catalysts will improve the 
energy capacity of such cells. In the short term, 
this augurs well for applications such as 
portable electronic devices. For transportation 
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Figure 1 | Simplified mechanism for the catalytic generation of hydrogen from methanol. a, Nielsen 
et al.” describe ruthenium complexes that catalyse the dehydrogenation of methanol (CH;OH) in basic 
solution, yielding three equivalents of H, and an equivalent of CO,. b, The active catalyst is generated 
from a catalyst precursor when the base removes hydrogen chloride (HCl) from the complex. Methanol 
forms a complex with the active catalyst and then loses a molecule of hydrogen. In the presence of water, 
a gem-diol(ate) complex forms. This loses another hydrogen molecule, forming a formate complex, 
which loses a third hydrogen molecule and a molecule of carbon dioxide, regenerating the active catalyst. 
Ru, ruthenium; R can be phenyl (C,H;) or isopropyl (CH(CH,),). Broken lines indicate interatomic 


bridging interactions through hydrogens. 


applications, however, the catalyst-turnover 
frequencies will have to rise substantially. 

The active form of the catalyst is gener- 
ated in situ from a precursor complex in 
which ruthenium is bound by the nitrogen 
and phosphorus atoms of a ‘tridentate’ ligand 
molecule, HN(CH,CH,PR,), (where R is 
either a phenyl group, C,H,, or an isopropyl 
group, CH(CH;),). The ruthenium is also 
bound by a hydrogen atom, a chloride ion and 
a molecule of carbon monoxide. Nielsen and 
co-workers probed the mechanism of action of 
their catalyst, and found that the dissolved base 
removes a proton (H") from the nitrogen of the 
tridentate ligand to generate the active catalyst 
(Fig. 1b). This nitrogen and the ruthenium 
atom then interact with methanol to liberate 
ahydrogen molecule (H,), transiently genera- 
ting formaldehyde (O=CH,). In the presence of 
water, the formaldehyde forms a gem-diol(ate) 
species (CH,(OH)O ), and then becomes a for- 
mate intermediate (HCO, ) by losing another 
hydrogen molecule. Loss of CO, and ofa third 
hydrogen molecule from the formate and the 
nitrogen of the ligand on ruthenium regener- 
ates the active form of the catalyst. 


Nielsen and colleagues’ work is a seminal 
finding, because it demonstrates the viability 
of a soluble catalyst for the efficient and long- 
lived generation of hydrogen from methanol. 
Chemists know a great deal about the design 
and optimization of molecular catalysts, and so 
they will undoubtedly embark on further stud- 
ies aimed at improving the catalytic activity of 
ruthenium complexes. They could thereby dis- 
cover innovations that will bring us even closer 
to the methanol economy. m 
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The units of a song 


Exactly when motor-planning neurons function to produce a bird’s song is 
debatable. New data suggest that bursts of activity in these cells mark sudden 
changes in the commands to the vocal organ. SEE ARTICLE P.59 


TODD W. TROYER 


hat is the basic unit of speech? The 
word? The syllable? The phoneme? 
This question has been vexing 


speech and language researchers for decades, 
and similar questions have challenged those 
who study songbirds. Whereas behavioural 
evidence’ supports the grouping of songs into 
100-250-millisecond vocalizations called syl- 
lables, neurophysiological data suggest’ that 
the premotor areas at high levels in the hier- 
archy of motor neurons in the brain act more 
like a clock, providing a continuous stream 
of activity on a 10-millisecond timescale. On 
page 59 of this issue, Amador et al.’ reconcile 
these data by providing evidence that the song 
code generated by motor neurons of zebra 
finches (Taeniopygia guttata) is indeed broken 
into discrete ‘gestures, which are significantly 
shorter than song syllables*. 

The study has roots in two research 
programmes that started at opposite 
ends of the motor-coding problem. 
One group studied the highest levels 
of the motor system, in which sensory 
signals about a song’s acoustics change 
the song motor program during learn- 
ing. The researchers discovered‘ that, 
for every rendition of the bird’s song, 
individual neurons produce short 
bursts of activity with incredible regu- 
larity and precision. They also demon- 
strated’ a remarkable correspondence 
between the motor activity that was 
recorded when the bird was singing 
and the auditory activity that resulted 


suggested that, to sing, birds may not need pre- 
cise control over a large ensemble of muscles. 
Rather, two basic signals may suffice, as long 
as the signals are controlled in a temporally 
precise manner. 

Combining their previous approaches, the 
two research programmes now come together. 
Amador et al. focus on a high-level cluster of 
neurons called the HVC, which is essential for 
singing, but — in terms of synaptic connec- 
tions between neurons — is the most distant 
from the syrinx. They recorded the activity of 
individual HVC cells either while the birds 
sang or during playback of the bird’s own song 
while it slept. They also tuned the syrinx model 
to reproduce each bird’s song. By defining a 
vocal gesture as a period of time when both the 
pressure and tension parameters were either 
unchanged or strictly increasing or decreasing, 
they could divide the song into a sequence of 
distinct gestural units. 


Recorded activity 
=a 


a_ Clock hypothesis 


On aligning the neural and behavioural 
data, the authors found that activity bursts in 
HVC neurons occurred at specific time points 
in the song, namely at the boundaries between 
gestures. The results suggest that the gesture — 
which is longer than a burst but shorter than 
a syllable — might be the basic unit of song 
production. 

This finding contrasts with the reign- 
ing view of the motor code for birdsong that 
was originally developed** to account for 
the precise bursting activity of HVC neurons 
(Fig. 1). Finding no clear relationship between 
burst timing and the division of song into 
syllable-base units, researchers proposed that 
the HVC acted more like a clock: bursting in 
one set of HVC neurons triggered a burst in 
the next set, forming a continuous set of ‘ticks’ 
throughout the song. 

Although the clock and gesture hypotheses 
lead to different views of the motor code for 
song, it is entirely possible that whereas burst- 
ing activity in HVC neurons tends to align 
with gesture transitions, a sufficient number 
of HVC neurons is active throughout each ges- 
ture to sustain clock-like functionality. Because 
ruling out this variation on the clock hypoth- 
esis would require demonstrating a nega- 
tive — that there are no HVC neurons active 
during gestures — the debate over the status 
of the two hypotheses will probably linger 
for some time. 

Amador and colleagues’ results also 
contain a deeper mystery, the resolu- 
tion of which may yield insight into 
how a bird learns its song (Fig. 2). 
The mystery stems from their 
observation that the average delay 
between an HVC burst and its asso- 
ciated gesture transition was near 
zero milliseconds. However, neural 
signals in the HVC must be relayed 
through several stages before they 
can alter the contraction of respira- 
tory and syringeal muscles, a process 
estimated to take 20 milliseconds’. 
Thus, the bursts recorded during 
singing occur too late to actually 


from playingthebird’ssongbacktoit eee cause gesture transitions. Similarly, 
whenitwasasleep. the sound signal that arrives at the 

The other team investigated how bird’s ears has to traverse several 
sound is generated by the avian vocal fee to ef @ synapses, causing an estimated delay 
organ, the syrinx. They developed a = — f of 15 milliseconds, before a sensory 


simplified biophysical model of the 
syrinx with two dynamic parameters: 
the pressure in the bird’s air sac and 
the spring-like tension on a vibrating 
membrane controlled by the muscles 
surrounding the syrinx. Analysis of 
the model showed that small changes 
in pressure and tension can lead to 
output that is a passable imitation of 
the sounds produced by several spe- 
cies of songbird®’. This work also 


*This article and the paper under discussion? 
were published online on 27 February 2013. 
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Figure 1 | The clock and gesture hypotheses. In the premotor 

cluster of HVC neurons, which is essential for singing, each neuron 
produces a single burst of activity (bars) precisely locked to the song 
output. Recordings are possible from only a few neurons (red) in any 
given bird. a, It was proposed that the unrecorded neurons (open) 

are continuously active throughout the song, acting like a clock to 
pace the song output. b, By building a model of the bird’s vocal organ, 
Amador et al.’ produce a new set of ‘sheet music’ for the song that 
specifies the motor commands needed to make any given sound. They 
find that every burst they recorded fell near a transition point between 
gestures (start times for notes in the sheet), suggesting that song is 
encoded asa series of distinct units. 
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representation of it is registered in 
the HVC. This means that the bursts 
recorded during sleep, which align 
to sound with a zero-millisecond 
delay, occur too early to be caused 
by the auditory detection of a gesture 
transition. 

Although we cannot yet expect 
definitive answers to the question of 
how high-level motor representations 
determine the control signals for song 
production, the syringeal- modelling 
approach pursued by Amador et al. 


ALEXANDRAPHOTOS/FLICKR/GETTY IMAGES 


Figure 2 | The singer under investigation’. The zebra finch Taeniopygia guttata. 


provides both a method for breaking the song 
down into its basic units and evidence that 
HVC bursts are related to specific events in 
a bird’s song. With a better understanding of 
the basic units, these results provide a foun- 
dation for understanding how birds learn to 
string these pieces back together to produce 
a whole song. = 
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A fable of too 
much too fast 


A bacterium and a fungus both use gene sequences that fail to optimize the 
production of circadian-clock proteins. Two studies reveal different reasons for 
the advantages of producing less protein. SEE LETTERS P.111 & p.116 


JENNIFER M. HURLEY & JAY C. DUNLAP 


his is a tale of two organisms from two 

kingdoms that hail from very different 

habitats and that have nothing in com- 
mon other than their ability to keep time, albeit 
using wholly distinct circadian systems. In 
this issue, Xu et al.’ (page 116) and Zhou et al.’ 
(page 111) respectively studied a freshwater 
cyanobacterium and a fungus. What these 
studies share is the fact that each organism has 
exploited the phenomenon of codon-usage 
bias to produce less of certain proteins and 
thereby build a more useful circadian clock. 
This modern fable features model organisms 


used in clever ways to reveal something new 
and confirm something old, and it reminds us 
that the goal of evolution is not necessarily to 
do more of something, but to do it better*. 
First the cast of characters. The cyanobac- 
terium is Synechococcus elongatus, in which 
three proteins, KaiA, KaiB and KaiC, self- 
assemble into a feedback loop that can keep 
time without the need for additional gene tran- 
scription or protein translation’. The fungus 
is Neurospora crassa, an exemplar of the circa- 
dian clocks found in fungi and animals’ that 
operate using transcription-translation-based 


*This article and the papers under discussion’ were 
published online on 17 February 2013. 
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feedback loops. In these systems, hetero- 
dimeric transcription factors drive the 
expression of one or more ‘negative-element 
proteins — after a lag, these negative elements 
turn down the activity of their transcriptional 
activator. The Neurospora heterodimeric acti- 
vator is formed from WC-1 and WC-2, and 
the negative element is FRQ, a protein that 
can be inactivated by phosphorylation at more 
than 100 amino-acid residues, consistent with 
its long-predicted propensity for inherent 
structural disorder”®. 

The two research groups respectively 
assessed codon usage in the genes encoding 
KaiB-KaiC (KaiBC) and FRQ (codons are the 
three-nucleotide messenger RNA sequences 
that code for different amino acids). They 
found that both genes lack strong codon- 
usage biases, meaning that the codons used 
to encode the amino acids are not those that 
would optimize abundant protein expression. 
The authors demonstrate that this lack of bias 
is advantageous to the organism in both cases, 
although for different reasons. 

Codon-usage bias stems from the fact that 
there is redundancy in the process of protein 
coding. Twenty standard amino acids are 
used to form proteins, but because the three- 
letter codon code gives 64 possible combina- 
tions (the four nucleotide bases taken three at 
a time), there is the potential for more than 
one codon for each amino acid. Transfer RNA 
(tRNA) molecules recognize codons and carry 
the appropriate amino acid to the ribosome 
(the cellular machine that synthesizes pro- 
teins), and tRNAs can be expressed at differ- 
ent levels. It is thought that highly used codons 
correlate with abundant matching tRNA mol- 
ecules, and that genes evolve to optimize the 
efficiency of protein translation on the ration- 
ale that faster protein synthesis yields more 
protein and that this is good. According to this 
concept, the translation rate depends on the 
codons used by the encoding gene, and in vitro 
studies suggest that this can not only affect the 
amount of protein made, but also influence 
the process of folding a polypeptide chain into 
the correct three-dimensional protein struc- 
ture’. But until now, few studies have pro- 
vided in vivo demonstrations of the biological 
significance of non-optimal codon usage. 

Xu et al. and Zhou et al. sought to under- 
stand the surprising finding that neither kaiBC 
nor frq use optimal codons, by expressing ver- 
sions of the genes in which all or some of the 
codons were optimized. As expected, in both 
cases the optimized versions made more pro- 
tein, but in neither case was ‘more protei 
actually more useful. 

Zhou and colleagues found that FRQ that 
was expressed from a gene in which either rare 
codons or all codons near the amino terminus 
of the protein were optimized could not func- 
tion in the Neurospora clock despite the pres- 
ence of adequate cellular FRQ levels. Further 
probing revealed that this forced codon usage 
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Figure 1 | Balancing quality and quantity. Ribosomes are cellular 

machines that produce proteins by translating messenger RNA codons of 
three nucleotides into the corresponding amino acids. Some codons are 
known to facilitate more-rapid protein production than others, but it seems 
that producing more protein more rapidly is not always advantageous. 

a, Zhou et al.’ and Xu et al.' find that both the fungus Neurospora and the 
cyanobacterium Synechococcus adopt non-optimal codon usage for their 
circadian-clock genes — frq and kaiBC, respectively. This produces normal 
amounts of correctly folded protein. b, Zhou et al. show that, for production of 
FRQ protein in Neurospora (assuming that the initiation of protein translation 
is the limiting step and that protein elongation then proceeds rapidly), 


induced constant, rather than rhythmic, FRQ 
expression, and also led to hyperphosphoryla- 
tion of the protein and impaired interactions 
with WC-2. This ‘optimized’ FRQ also seemed 
to be more sensitive to degradation by protease 
enzymes, leading the authors to suggest that 
codon optimization in frq leads to an increased 
translation rate that affects protein folding and 
that thereby changes FRQ structure (Fig. 1). 
Interestingly, optimizing codon usage at a 
region that encodes the middle of FRQ and 
that contains many phosphorylation sites was 
also disruptive, yielding arrhythmic protein 
expression and FRQ that displayed altered 
protease sensitivity and inadequate phospho- 
rylation, again indicating that proper protein 
folding was not attained. The reasonable con- 
clusion from these findings is that the non- 
optimal codon usage for the wild-type version 
of this gene in Neurospora is a mechanism to 
allow the proper co-translational folding of 
FRQ by reducing the translation rate. 
Working with Synechococcus, Xu et al. 
also found that optimizing codon usage in 
kaiBC led to the cells forming more protein. 
Surprisingly, however, codon optimization 
produced a more robust clock than seen in 
the wild type, especially when the bacteria 
were grown at low temperatures (which they 
may normally encounter). So why has the 
organism not evolved to optimize codon 
usage, if this produces a more robust clock 
across different temperatures? The authors 
show that, despite their weaker rhythm, the 
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natural conditions. 


wild-type strain grew better under light-dark 
cycles (to mimic natural growing condi- 
tions) than the codon-optimized strain at low 
temperatures. 

The explanation for this goes back to circa- 
dian first principles: clocks are evolutionarily 
advantageous because they help to coordinate 
activities in organisms such that they happen 
at appropriate times of day (for an example, 
see ref. 8). In this case, the bacterium’s natural 
rhythm at low temperature can be as long as 
30 hours. Circadian-entrainment theory and 
practice both show that when a 30-h clock is 
entrained to a 24-h light-dark cycle, it must do 
so with a significantly later phase angle — so 
late that clock-regulated activities are driven 
to later and inappropriate times’. Thus, the 
temperature conditionality of the wild-type 
system, in which rhythms are weakened or 
lost at low temperatures, is good: no clock is 
better than a maladaptive clock (Fig. 1). This 
may suggest a rationale for the conditional 
loss of rhythms previously reported in other 
organisms unable to control their internal 
body temperatures, including cyanobacteria, 
fungi"’, dinoflagellates" and higher plants”. 

These findings suggest that the lack of codon 
optimization in the two organisms probably 
reflects two different factors. In Synechococcus, 
kaiBC mRNA is extremely abundant and 
KaiBC is a highly structured protein, so codon 
optimization has its impact on the amount of 
functional protein that is made. By contrast, 
frq mRNA in Neurospora is rare and FRQ is 
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Normal amount 
of correctly 
folded protein 


Incorrectly 
folded protein 


Too much 
correctly 
folded protein 


a Ake 


forced codon optimization causes its mRNA to be translated so fast that 
protein folding fails, resulting in a misformed and non-functional protein 
that does not support a rhythm. c, By contrast, Xu et al. show that, in 
Synechococcus, optimized codon usage in the kaiBC gene leads to a more 
robust rhythm and enhanced clock-protein production (also assuming 
limiting initiation and rapid elongation). Surprisingly, at low temperatures, 
the bacterium actually grows better using the non-optimized codon usage 
that yields lower clock-protein production and a weakened clock. Thus, it 
seems that both organisms have evolved non-optimal codon usage to produce 
a reduced number of properly folded clock proteins that are adaptive to 


predicted to be disordered, so the choice of 
codons is aimed at producing a limited amount 
of protein with the correct structure. But in 
both cases, the moral of the story is: less can 
be more, and quality is more important than 
quantity. 
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Elemental gesture dynamics are encoded 
by song premotor cortical neurons 


Ana Amador}}, Yonatan Sanz Perl’, Gabriel B. Mindlin? & Daniel Margoliash' 


Quantitative biomechanical models can identify control parameters that are used during movements, and movement 
parameters that are encoded by premotor neurons. We fit a mathematical dynamical systems model including 
subsyringeal pressure, syringeal biomechanics and upper-vocal-tract filtering to the songs of zebra finches. This 
reduces the dimensionality of singing dynamics, described as trajectories (motor ‘gestures’) in a space of syringeal 
pressure and tension. Here we assess model performance by characterizing the auditory response ‘replay’ of song 
premotor HVC neurons to the presentation of song variants in sleeping birds, and by examining HVC activity in 
singing birds. HVC projection neurons were excited and interneurons were suppressed within a few milliseconds of 
the extreme time points of the gesture trajectories. Thus, the HVC precisely encodes vocal motor output through activity 
at the times of extreme points of movement trajectories. We propose that the sequential activity of HVC neurons is used as 
a ‘forward’ model, representing the sequence of gestures in song to make predictions on expected behaviour and 


evaluate feedback. 


For a given set of movements, sets of movement parameters tend to be 
correlated with each other, so that it is difficult to resolve whether motor 
cortical neurons encode different sets of static parameters (for example, 
position, velocity and direction), or even to distinguish between static 
and time-dependent parameters (for example, path trajectory)’. In 
principle, the motor coding problem can be addressed by developing 
quantitative models that describe the biomechanics of the movements’. 
To the extent that such models capture the actual control elements used 
to produce a movement, this permits motor cortical neuron activity to 
be evaluated in a natural framework. We examined motor control in the 
avian song system from this perspective, creating a dynamical systems 
model of the avian vocal organ (syrinx) that captures many of the rich 
set of vocal behaviours that characterize bird songs’. 

We assessed predictions of the biomechanical model by taking 
advantage of a neuronal replay phenomenon* ®. Neurons in the nuc- 
leus HVC, a secondary motor or association cortex structure (among 
the structures that are essential for singing, this is the most synapti- 
cally distant from the syrinx), emit precise premotor activity when a 
bird sings’. Interestingly, when a bird listens to playback of its own 
song, the same HVC neurons have responses that are very similar in 
timing and structure to their responses during singing®, and that are 
highly selective for the bird’s own song (BOS)*’. In zebra finches, 
there is a notable state-dependent neuronal replay phenomenon* 
associated with song learning", so that the strongest and most selec- 
tive auditory responses are recorded in sleeping birds. We used the 
responses of HVC neurons in sleeping adult zebra finches to evaluate 
the responses to BOS and artifical BOS variants, and then tested 
emerging hypotheses by recording from singing birds. 


Validating a song model and estimating static parameters 


The avian vocal organ is a nonlinear device'!" that is capable of 


generating complex sounds even when driven by simple instruc- 
tions'*’°. We extended a low-dimensional model of the avian syrinx 
and vocal tract that can capture a variety of acoustic features such as 
the precise relationship between fundamental frequency and spectral 


content of zebra finch song’*'”. The model used here is summarized 
in Fig. 1. A two-dimensional set of equations describes the labial 
dynamics (Fig. 1; see Methods). Flow fluctuations are fed into a vocal 
tract, generating an input sound (P;,,(t)). The tract filters the sound 
and is characterized as a trachea, modelled by a tube, which connects 
to the oro-oesophageal cavity (OEC), modelled here as a Helmholtz 
resonator’® (see Methods). The output of the model is a time trace 
representing the uttered sound (Pou(t)). 

Using this model, we created synthetic versions of the songs that 
our test birds sang. Time-dependent parameters of the model describ- 
ing the labial dynamics were reconstructed to account for the time- 
dependent acoustic properties of the sound (see Methods). Following 
procedures described previously*'*”” for each bird’s song we used an 
algorithmic procedure to reconstruct unique functions for the air-sac 
pressure (a(f)) and the tension of syringeal labia (((#)). The result of 
the procedure for one song is illustrated in Fig. 2, showing that many 
features observed in the spectrograph of the recorded song (Fig. 2a) 
were also present in the synthesized song (Fig. 2b). Relatively simple 
time traces of reconstructed pressure and tension arose from fitting 
the bird’s song (Fig. 2c). These two functions drove the nonlinear 
equations for the labia to produce a wide range of diverse acoustic 
features. The parameter space of pressure versus tension was orga- 
nized by bifurcation curves (Fig. 2d); curves in the parameter space 
that separated regions in which the model presented qualitatively 
different dynamics (sound patterns). Only one region (Fig. 2d) cor- 
responded to oscillatory behaviour in which labial oscillations 
resulted in sound-pressure fluctuations. Two features of the pressure- 
tension trajectories resulting in sound output were apparent (Fig. 2d). 
First, most of the control parameters were maintained close to 
bifurcation curves, facilitating rapid changes in the quality of sound 
output with small changes in parameter values; and second, there 
were many sounds that were characterized principally by movements 
in pressure or tension, but not both. 

Song was described by the sequence of these pressure—-tension 
trajectories, which we call gestures, with gesture onsets and offsets 
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OEC: modelled as a 
Helmholtz oscillator 
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Figure 1 | Schematized view of a dynamical systems model describing 
syringeal labial dynamics and tracheal vocal-tract filtering. The syringeal 
membrane was modelled as a mass (m) with damping (b) and a restitution 
(spring) force (k). Normal form equations for labial position (x(t), red line) 
were integrated, computing the input pressure at the vocal tract (Pin(t), green 


defined as discontinuities in either the pressure or tension functions 
(Fig. 2c). Gestures include movements that do not result in phonation, 
such as pressure patterns associated with mini-breaths between 
syllables’, but our recordings were limited here to airborne sounds. 
In a sample of 8 modelled songs, there were 13 + 4 gestures per motif 
(largest basic unit of song, a repeated sequence of syllables). The 
distribution of gesture durations (mode = 22.5 +2.5ms, range 
4-142 ms) was non-Gaussian, with 33% of the gestures lasting less 
than 30 ms, and it had a long tail corresponding to slowly varying 
sounds, such as constant-frequency harmonic stacks (Fig. 2e). 

This simple model captured the essential features of sound produc- 
tion in a framework of labial tension and subsyringeal pressure over 
which birds have direct motor control’’*’. Although the syrinx has 
considerable additional complexity, the model described the vocal 
system in a low-dimensional space, enabling us to capture a wide range 
of acoustic features using a small set of time-dependent parameters. 

We tested the model by comparing responses of HVC neurons to 
the broadcast of the modelled song (mBOS) with the BOS in sleeping 
birds (Fig. 3). Responses to a grid of mBOS stimuli with identical 
timing but different spectra from BOS identified optimal estimates 
for two remaining free static parameters (Supplementary Fig. 1). In 
sleeping birds, song-system neurons are exceptionally selective and it 
was difficult to induce a response: for example, mBOS generated 
without the OEC component failed to elicit responses. On one occa- 
sion we mis-estimated the duration of a component of BOS by 5 ms, 
and found one neuron that responded strongly to the BOS but not at 
all to the synthetic song (Supplementary Fig. 2b). Over a population of 
30 neurons, the best mBOS elicited 58% + 8% of the response to BOS 
(Supplementary Note 1). Phasic projection neurons (HVC,; n = 15) 
and tonic interneurons (HVC;; n = 15) both responded selectively to 
mBOS over non-BOS stimuli (Supplementary Note 1). These results 
show that a low-dimensional model representing an approximation of 
peripheral mechanics is sufficient to capture behaviourally relevant 
features of song. 


Projection neurons burst at gesture extrema 

We next evaluated the activity of HVC neurons relative to model 
dynamics, by analysing the timing of spike bursting relative to the 
pressure-tension trajectories used to synthesize the mBOS. This iden- 
tified a compelling relationship between the timing of HVC, spikes and 
the pressure-tension trajectories. For example, in Fig. 4a the spiking of 
two neurons (coded with different colours) is shown relative to the BOS 
spectrograph, oscillograph and reconstructed pressure and tension 
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line), filtered by the oro-oesophageal cavity (OEC), and ultimately the total 
output pressure (P,u:(2), blue line). y, time constant; r, reflection coefficient of 
the trachea; T, propagation time along trachea; v, proportional to the mean 
velocity of the flow; y, velocity (see Methods). 


time series. One of the neurons burst once, at the transition between 
descending frequency modulations and a constant frequency ‘high 
note’. The other neuron burst twice, once when the pressure during a 
high note reached a maximum, and once at the transition between a 
high frequency chevron and a broadband frequency-modulated sound. 
Similar relationships between spike-burst timing and gestures were 
seen for 14 of the 15 HVC, (Supplementary Figs 2 and 3). In one case, 
a neuron emitted bursts in the interval between syllables. We propose 
that this pattern might arise if the bursts are associated with mini- 
breaths during singing’. Only the 17 bursts occurring during phona- 
tion were considered for further analysis. 

Examination of the responses of the HVC, using pressure versus 
tension plots showed that neurons burst preferentially at gesture- 
trajectory extrema (GTE) associated with gestures (Fig. 4b). A gesture 
has at least two GTE, at its beginning and end, and up to two addi- 
tional GTE, if the absolute maxima of pressure and/or tension rep- 
resent unique and distinct time points. In cases in which the absolute 
maximum is not distinct in time, no additional GTE result (for 
example, if there are multiple local maxima with the same mag- 
nitude). Of the 17 bursts (14HVC,), 11(65%) were aligned with 
onsets or offsets, and 6 (35%) were aligned with pressure or tension 
maxima. In a sample of 5 songs, there were 28 + 4GTE per song 
(165 GTE in total). From a total of 60 gestures, 20 (33.4%) had only 
onset and offset GTE; in addition, 30 (50%) had a unique peak in 
pressure (3 GTE per gesture), 5 (8.3%) had a unique peak in tension 
(3 GTE per gesture); and 5 (8.3%) had unique peaks in both pressure 
and tension. The distribution of time intervals between successive 
GTE (mode =9+1ms, range 4-116ms) was non-Gaussian, and 
66% of the intervals had a duration of less than or equal to 30 ms 
(Fig. 4c). Most gestures corresponded to notes (the smallest unit of 
song organization recognized by ornithologists), but motor activity at 
GTE maxima could subdivide notes; for example, at the point at which 
a neuron burst and the pressure reached a maximum in the middle of 
a constant-frequency harmonic stack (Supplementary Fig. 2). These 
examples highlight that for some HVC,, the patterns of activity would 
not be interpretable with a purely spectrographic analysis of song’. 
We also observed cases in which HVC, burst at the onset of relatively 
pure pressure-only or tension-only trajectories, with a tendency 
towards pressure-only trajectories (Fig. 2d). If such neurons project 
to distinct regions of the HVC’s afferent targets, which are organized 
based on the syringeal muscles and interactions with respiratory 
system, such observation could help to resolve the long-standing 
riddle of the HVC’s topographic organization. 
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Figure 2 | A low-dimensional model for reconstructing gestures. 

a-e, Spectrographs of a bird’s song (a) and a model synthetic song (b). Song is 
described by fitted parameters a(t) and /(¢), proportional to air-sac pressure 
and labial tension, respectively (c; each distinct sound is colour coded). Each 
sound (same colour code as in ¢c) is generated by a continuous curve in the 
parameter space of the model, that we define as a ‘gesture’ (d). Oscillations in 
the vicinity of a Hopf bifurcation are almost tonal, whereas oscillations near a 
saddle-node (SN) bifurcation present rich spectra, typical of zebra finch song. 
Note that the spectrally poor ‘high note’ (green) is distant from the SN 
bifurcation. The grey area indicates the region of phonation. The distribution of 
gesture durations for five birds is displayed in e. a.u., arbitrary units. 


To quantify these observations, we calculated the time between 
each spike in each burst to the closest GTE for all 17 bursts. The 
resulting distribution was approximately Gaussian and, on average, 
the bursts preceded the closest GTE (mean = —5.6 + 0.3 ms, o = 
6.7 + 0.3 ms; Fig. 4d). A bootstrap procedure (Supplementary Note 2) 
confirmed that the correspondence to the closest GTE was statistically 
significant (F-test, P< 0.045). This indicates that the timing of HVC, 
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Figure 3 | Testing the low-dimensional model. The activity of HVC-selective 
neurons of sleeping birds in response to the presentation of BOS and mBOS was 
similar. The timing of the three repeated motifs that were presented is indicated 
by the bold horizontal lines. 


bursts is associated with the timing of GTE. Given a minimal delay 
between activity of HVC, and sound production estimated between 
25-50 ms”’, the minimal 15-ms delay for auditory feedback to HVC*, 
and that the duration of intervals between GTE varied greatly (Fig. 4c), it 
is notable that the timing of HVC, bursting was synchronized with a 
near-zero time lag to a model of actual behavioural output. 


Interneurons are suppressed at GTE 


HVC; show local minima and maxima in their tonic activity through- 
out song, and we noted that there was a relationship between the 
minima and the timing of GTE. To characterize this, for each inter- 
neuron we binned the spikes in 10-ms windows for each acoustic 
presentation. The resultant average response traces were smoothed 
and the minima in the smoothed traces were identified (see Methods; 
an example neuron is shown in Fig. 5a, bottom panel). Each HVC; did 
not have minima at all GTE, but across all neurons we observed a 
close alignment between the times of the minima and the times of 
GTE (a non-significant relationship was observed for maxima of 
HVC; activity; Supplementary Fig. 4). Computing the differences 
between the time of each minimum that occurred during phonation 
and the closest GTE resulted in a distribution that was approximately 
Gaussian (mean = —0.82 ms + 0.60 ms, o = 7.3 + 1.4ms; Fig. 5b). 
We compared this distribution to the distribution of randomly posi- 
tioned minima within each motif using the bootstrap procedure 
and found them to be significantly different (F-test, P< 0.016; Sup- 
plementary Note 2). Additional tests identified marginally significant 
timing of minima relative to GTE for one of four birds (Supplemen- 
tary Note 3). Thus, the precise activity of HVC; (ref. 7) can help to 
shape the timing of HVC,. This suggests a simple model in which 
bursts of activity of HVC, suppress activity in HVC,, whose ongoing 
activity helps to shape the next HVC, burst. 


A representation of gestures during singing 

Given that our results were obtained by broadcasting songs to sleeping 
birds, it is natural to enquire whether the activity of HVC neurons are 
also locked to gesture transitions during singing. Previous results have 
shown similar patterns of spike bursts between the daytime singing 
activity and the auditory-driven responses during sleep of single 
neurons of the robust nucleus of the arcopallium (RA) in zebra 
finches*. HVC neurons in awake swamp sparrows and Bengalese 
finches, which respond to auditory stimulation, also show auditory- 
motor correspondance in their activity patterns®. However, similar 
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observations have yet to be reported for zebra finch HVC neurons. 
We made recordings from HVC in singing birds (n = 3 birds), includ- 
ing tentonic neurons, and five phasic neurons that burst during 
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Figure 4 | Timing of gestures relative to bursting 
of projection neurons. a, Song spectrograph and 
oscillograph (top two panels); reconstructed 


14 parameters of pressure and tension (third and 
10 fourth panels; the gestures of interest are coloured 
magenta, blue and cyan), with tick marks 
6 indicating the times of all GTE; and raster plots of 
2 the responses of two neurons (bottom panel, 
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colours. b, The trajectories in pressure-tension 
parameter space, with a point indicating the mean 
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phonation, recorded in two of the three birds (Fig. 6 and Sup- 
plementary Fig. 5); one phasic neuron had two bursts per motif. 
These recordings confirmed that during singing, all sparse bursts of 
HVC, occurred at gesture transitions (Fig. 4e). The same analysis that 
was used in sleeping birds was then used for singing birds (except that 
as each motif of song could vary, it was independently modelled here), 
and this showed that the timing of HVC, for singing birds was even 
more precise than during sleeping (see Fig. 4d, e). The Gaussian fit for 
the population of phasic neurons recorded during singing (mean = 
—1.35ms + 0.10 ms, o = 4.0 + 0.1 ms; Fig. 4e) was significantly dif- 
ferent from the bootstrapped random distribution (F-test, P< 0.025; 
see Supplementary Note 2 and Supplementary Fig. 6). The minima 
activity of tonic neurons recorded during singing also showed precise 
timing relative to GTE (Gaussian fit for the minima: mean = —0.12 
ms + 0.4ms, ¢ = 4.0 + 0.4ms; Fig. 5c), and this was significantly 
different to the bootstrapped random distribution (F-test, P < 0.002). 
Additional analyses showed significant locking of minima to GTE in 
two out of three singing birds (Supplementary Note 3). As for sleeping 
birds, the maxima of tonic neural activity showed no evidence of a 
significant locking to the GTEs (Supplementary Fig. 4c). Finally, exam- 
ination of data from a previous study of zebra finches” showed that 
during singing the timing of HVCra bursts were closely associated 
with the timing of the bursts of the other class of HVC projection 
neurons, which project to Area X (HVCx), the basal ganglia com- 
ponent of the song system (Supplementary Fig. 7). Our results support 
the hypothesis that all classes of HVC neurons are active in relation to 
the timing of gestures, although the multiple subtypes of HVCaa, 
HVCy, and HVC; have yet to be evaluated. 

It was thought previously that the timing of song syllables was 
unrelated to the timing of HVC, discharge in singing birds*”*. 
Given the sparse bursting of these cells, this led to the idea that the 
output of HVC had a clock-like function with a nearly uniform ‘tick’ 
size of approximately 10 ms (ref. 23) supported by a ‘syn-fire’ chain 
of synaptic activity across HVC, (ref. 5). Instead, we find that the 


Figure 5 | Suppressed interneuron activity is associated with GTE. a, Song 
spectrograph and oscillograph, reconstructed parameters of pressure and 
tension, and raster plots, organized as in Fig. 4a, but with spike count response 
to the song (10-ms bin, 20 repetitions; green line) for one HVC;, and a 
smoothed measure of the response (black line; see Methods). Red squares 
indicate the time of the minima in the smoothed measure, and the vertical lines 
indicate the position of the closest GTE to each minima. b, distribution of time 
differences between spike response minima (Tiqnin)) and their closest GTE 
(T.cre) in sleeping birds (n = 15 HVC;,, 5 birds). c, Same analysis as b but for 
singing birds (n = 10 HVC, 3 birds). 
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Figure 6 | During singing, HVC projection neurons fired in the vicinity of 
GTE. a, An HVC, neuron with spike bursting that is time-locked to the vicinity 
ofa GTE, even as the syllable sequence and time-interval varies. Letters over the 


bursting of HVC, and modulation of HVC; activity is timed to par- 
ticular time points of motor gestures. The sequential firing across the 
population of HVC, unfolds in an ordered fashion*, but time is not 
explicitly represented in HVC. Instead, the statistics of HVC activity 
are closely tied to syringeal—vocal tract mechanics. Given the broad 
distribution of times between GTE, if HVC activity is synchronized 
with GTE this is inconsistent with a syn-fire network that is active at 
every moment. The differences between these two models of HVC 
have additional broad implications for the functional organization of 
the song system, for song learning and for motor coding. 

As gestures vary greatly in duration, and as the RA only has access 
to the times of GTE, downstream components of the motor pathway 
(the RA and presumably the brainstem) should generate independent 
dynamical information to sustain the detailed structure within each 
gesture (compare with refs 23, 25). Previous experiments—including 
examinations of the effects of electrical stimulation of HVC or RA 
during singing”®’, and lesions of nuclei afferent to the HVC” —implicate 
information in the HVC in the encoding of larger units of song. This 
might be explained, for example, if gestures at the start of syllables are 
overemphasized in HVC relative to gestures encoding mini-breaths 
preceding syllables. Finally, gestures are learned, and this is consistent 
with the physiological properties of HVC neurons: integration over 
hundreds of milliseconds and multiple syllables, nonlinear summation 
over syllables in a sequence preceding the excitatory response, and 
selective response to BOS**°?**°. The information on groupings of 
gestures, such as syllables, can be carried in these integrated signals. 
This also re-emphasizes that synaptic modification in HVC, not just 
changes at HVC-RA synapses, are associated with feedback-mediated 
sensorimotor learning (compare with ref. 23). The HVC also projects to 
the cortico-basal ganglia pathway, which contributes to learning- 
mediated synaptic modification in RA by introducing variance into 
song output*'”’. This suggests the hypothesis that the variance is struc- 
tured not in an auditory framework but around specific features of song 
motor gestures. 


A forward model for vocal motor control 

If activity in the HVC is synchronized, with little time lag, with motor 
gestures occurring at the periphery, this would tend to bring it into 
temporal register with fixed (circa 15ms) delayed auditory*’, pro- 
prioceptive” feedback. This allows movements to be represented in 
the HVC by gestures of greatly varying duration (with dynamics 
generated principally through internal HVC interactions), and with 
each gesture referenced to a common time framework for evaluating 
feedback (with feedback arriving through distinct, extrinsic inputs). 


sonograph identify the syllable types. b, For another bird, the bursts of a HVC, 
neuron are locked to a GTE in the vicinity of a subtle acoustic transition. 


This suggests that projection neurons represent a prediction of the 
actual behavioural output at that moment in time, constituting an 
unexpected form of a ‘forward’ or predictive model to resolve the 
problem of the delay in sensorimotor control*’. Assuming that beha- 
viour is subdivided into gestures, and that only the transitions (GTE) 
are represented by HVC output (HVC,), feedback information could 
accumulate in the intervals between the transitions by modifying the 
tonic activity of HVC; and subsequently, the spike bursting of HVC,. 
Indeed, HVC receives multiple sources of feedback, including input 
from the primary motor cortex RA*’, thalamic input carrying brain- 
stem respiratory, auditory and proprioceptive information*’**”’, and 
forebrain auditory input*®. 

We have described song organization based on gestures, using the 
dynamical systems modelling framework to replace analysis of songs 
based on spectrographs. These features of motor systems organiza- 
tion may be represented in other systems and for other behaviors”. 
Our data support Sherrington’s long-standing hypothesis that the 
motor cortex is a synthetic organ, representing segments of whole 
movements’. In humans, the production of speech and the perfor- 
mance of athletes and musicians are an exceptional example of highly 
precise learned skilled behaviour that could have similar mechanisms 
to those described here. The development of corresponding models for 
human speech production should help to provide insight into speech 
and language pathologies in which sequential behaviour is disrupted. 


METHODS SUMMARY 


Songs were recorded from 12 birds and electrophysiology was conducted on 
9 adult male zebra finches (Taeniopygia guttata) bred in our colony. Birds were 
prepared for HVC extracellular recordings using standard techniques; by 
implanting a head pin (auditory experiments)'® or motorized microdrive (singing 
experiments)”. Recordings were post-processed with a spike-sorting algorithm 
(Klusters, http://klusters.sourceforge.net) to separate the times of spike events for 
each unit. In sleeping birds, we picked the last (second or third) motif, which gave 
the strongest response, to analyse the timing of spikes relative to GTE. This 
minimized false peaks and troughs in the response profiles. The average response 
of each interneuron (1-ms resolution) was smoothed using a Savitsky Golay filter 
(polynomial local regression) and the minima were identified using a 21-point 
sliding window. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Subjects, songs and surgeries. All procedures were carried out in accordance 
with a protocol approved by the University of Chicago Institutional Animal Care 
and Use Committee. Songs were recorded from 12 birds and electrophysiology 
was conducted on 9 adult male zebra finches (Taeniopygia guttata) bred in our 
colony. Birds were prepared for recordings using standard techniques to implant 
a head pin (for auditory experiments)'® or motorized microdrive (for singing 
experiments)°. For auditory experiments, adults were maintained on a 16 h-8h 
reversed light cycle in sound-isolation boxes. Songs were recorded and filtered 
using custom software) and then these were edited using Praat (http://www. praat. 
org). Edited songs included two or three repetitions of one motif, and were 
typically 2-4 in duration. Birds were allowed to recover for 2 or 3 days before 
the first day of recordings, and were rested for at least 2 days between recording 
sessions. 

Electrophysiology, stimulus presentation and spike analysis. HVC extracellu- 
lar recordings were carried out using head-fixed sleeping or singing tethered 
birds. Recordings were post-processed with a spike-sorting algorithm (http:// 
Klusters.sourceforge.net) to separate the times of spike events for each unit. For 
experiments in singing birds, all well-isolated neurons are reported. For auditory 
experiments, only BOS-responsive neurons were recorded. The auditory stimuli 
were presented randomly with an interstimulus interval of 7 + 1s. The neural 
response to each song is quantified in terms of the Z score”: 


Hs — HG 
\/Var(S) + Var(BG) — 2Covar(S,BG) 


where jls is the mean response during the auditory stimulus (S) and [Ugg is 
the mean response during background activity (BG) (Covar is covariance, Var 
is variance). The denominator of the equation is the standard deviation of 
(S-BG). The background was estimated by averaging the firing rate during a 
2-s period. The Z scores of the mBOS (model BOS), CON (conspecific song) and 
REV (reversed BOS) were normalized to the BOS Z score, and averages across 
neurons were reported as means of normalized responses + s.e.m. For interneur- 
ons, the strength of the response varied across the motifs*’. To analyse the timing 
of spikes relative to GTE, we picked the last (second or third) motif, which gave 
the strongest response. This minimized false peaks and troughs in the response 
profiles. In singing birds, interneurons fired reliably for each motif and all motifs 
were incorporated into the analysis. The average response of each interneuron 
(1-ms resolution) was smoothed using a Savitsky Golay filter (polynomial local 
regression”) and the minima were identified using a 21-point sliding window. 
Reconstruction of motor gestures. We assumed flow-induced oscillations of 
opposing labia as a sound-source model for bird song production'*. This model 
assumes that for airflow values above a certain threshold, the labia start to oscillate 
with a wavelike motion. Assuming that two basic modes are active (a flapping-like 
motion and a lateral displacement of the tissues, appropriately out of phase), a 
system of equations describe the dynamics of the medial position x(t) of one of the 
opposing labia, at one of the sound sources. These read: 


dx 


& = (;.) ( —k(x)x— (b(y) + cx”) y+ AlabPav) 
t m 
where the first term in the second equation is the restitution in the labium, the 
second term accounts for the dissipation, and the last term for the force due to the 
interlabial pressure, where ajay is the labial area. The average pressure, pay, can be 
written in terms of the displacement and its velocity’. These equations describe 
a set of qualitatively different dynamical regimes. To gain independence from 
the details of any particular model presenting these regimes, we worked with a 
normal form that unfolds into a saddle-node in limit-cycle bifurcation and a Hopf 
bifurcation. The normal form, which is analytically derived**, constitutes the 
simplest set of equations for any model in which oscillations arise in either of 
these two bifurcations. Once this reduction is carried out, the selection of para- 
meters that enables a sound with specific acoustic features to be obtained gives 
rise to unique values. The normal form equations are shown in Fig. 1, and display 
the same set of dynamical regimes’ as the physical model, with scaling through a 
time constant y. Once x(t) is computed, the pressure at the input of the tract is 
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computed as P,,,(t) = v(t)x(t) — rPi,(t— T) where T is the time for a sound 
wave to reach the end of the tube and return, and v(t) is proportional to the 
average mean velocity of the flow. The transmitted pressure fluctuation 
Pi,(t) = (1 — r)Pin(t — 0.5T) forces the air in the glottis, which is approximated 
by the neck of a Helmholtz resonator (used to model the OEC*’); that is, a large 
container with a hole, such that the air in its vicinity oscillates owing to the 
springiness of the air in the cavity. A linear set of three ordinary differential 
equations accounts for the dynamics of the air flow and pressure in this linear 
acoustic device’, resulting in the final output pressure Pou:(t) (Fig. 1). 

We reconstructed the parameters driving the equations of the normal form 
(a(t) and f(t)), as well as the parameters describing the tracheal length and 
the OEC cavity in such a way that the synthesized sounds presented the same 
fundamental frequencies and spectral content as natural song. Reconstructions 
over sequential sound segments gave estimates of the time-dependence of physio- 
logical parameters used during song production. A linear integrator (t = 2.5 ms) 
was used to compute the envelope of the sound signal. A threshold was used to 
identify phonating segments. For those longer than 20 ms, we decomposed the 
recorded songs into successive 20-ms segments (time between consecutive seg- 
ments At = 1/20,000 s). These were short enough to avoid large variation of the 
physiological gestures, and long enough to compute spectral content. For each 
segment, we computed the spectral content index (SCI)’* and the fundamental 
frequency. A search in the parameter space a(t), ((t) was performed over a grid so 
that the synthetic sounds produced would match the fundamental frequencies of 
the song segment being fitted. Over the set of a(t), 6(t) values selected, a search 
was carried out so that the SCI of the synthetic sounds matched the value of the 
song segment’. For sound segments shorter than 20 ms, the fundamental fre- 
quency was computed as follows: first, we selected the relative maxima of the 
sound signal that reached the sound envelope; next, the fundamental frequency 
was computed as the inverse of the time difference between the next two consecu- 
tive selected maxima; after this, the SCI at that time was estimated as the average 
value among all the possible SCI values, corresponding to that frequency in the 
framework of the model’*. With those estimations of fundamental frequency and 
SCI, a(t) and f(t) were computed. In general, brief segments were found to be fast 
trills. We modelled these as rapid oscillations of pressure and tension, with the 
amplitude of the pressure oscillations such that the maxima fell in the phonating 
region, and with the amplitude of the tension oscillations such that the frequency 
range of the vocalization was reproduced. We found that most of the parameters 
could be approximated well by fractions of sine functions, exponential decays, 
constants or a combinations of these. 

Use of these analytic functions as parameters of the model to generate a 
synthetic copy of the recorded song resulted in a noiseless surrogate song (for 
example, Supplementary Fig. 1 (“Noise = 0’)). The addition of noise allowed the 
gradual recovery of realistic timbre features. The dimensionless variable ‘Noise’ 
varied between 0 and 40, with Noise = 5 corresponding to a fluctuation size equal 
to 2.5% of the maximum range of the /(f) parameter. Note that the effect of 
timbre is more important for low-frequency sounds, which explore a small range 
of f(t). 

For each bird, the length of the trachea was chosen so that the frequencies close 
to 2.5 kHz and 7 kHz in the bird’s song were the first and second resonances of a 
tube closed at one end. This corresponds to a length of 3.5 cm (ref. 45). Typically, 
zebra finch songs have a third important resonance at approximately 4 kHz. The 
parameters of the Helmholtz resonator were adjusted so that its resonant 
frequency would account for this resonance’. The synthetic songs for sleeping 
birds were generated before the electrophysiological experiments were carried 
out. For singing birds, all song reconstructions were also performed blind to the 
spike data. 
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Intestinal label-retaining cells are 
secretory precursors expressing Lgr5 


Simon J. A. Buczacki!, Heather Ireland Zecchini', Anna M. Nicholson', Roslin Russell’, Louis Vermeulen', Richard Kemp! 


& Douglas J. Winton! 


The rapid cell turnover of the intestinal epithelium is achieved from small numbers of stem cells located in the base 
of glandular crypts. These stem cells have been variously described as rapidly cycling or quiescent. A functional 
arrangement of stem cells that reconciles both of these behaviours has so far been difficult to obtain. Alternative 
explanations for quiescent cells have been that they act as a parallel or reserve population that replace rapidly cycling 
stem cells periodically or after injury; their exact nature remains unknown. Here we show mouse intestinal quiescent 
cells to be precursors that are committed to mature into differentiated secretory cells of the Paneth and enteroendocrine 
lineage. However, crucially we find that after intestinal injury they are capable of extensive proliferation and can give 
rise to clones comprising the main epithelial cell types. Thus, quiescent cells can be recalled to the stem-cell state. These 
findings establish quiescent cells as an effective clonogenic reserve and provide a motivation for investigating their role 
in pathologies such as colorectal cancers and intestinal inflammation. 


The epithelial lining of the small intestine is continuously renewed 
from a small number of stem cells including Lgr5-expressing crypt 
base cells that have been shown to be a rapidly cycling stem-cell 
population in homeostasis’. Alternative markers of intestinal stem 
cells have variously identified populations as rapidly cycling or qui- 
escent with proposed roles for the latter as a parallel or reserve stem- 
cell population that act to replace rapidly cycling stem cells either 
periodically or after injury*’. Periodic replacement however, implies 
a hierarchical relationship that contrasts with the recent demonstra- 
tion that the ongoing loss and replacement of rapidly cycling stem 
cells explains all the cellular output of the crypt*’. Hence, contem- 
porary views consider the possibility of a reserve stem-cell population 
that is distinct from the rapidly cycling population responsible for 
homeostasis’*"". Quiescent or slowly cycling stem-cell populations are 
defined by the property of label retention. Under conditions of 
expansion of the stem-cell compartment (during the later stages of 
gut growth or during regeneration after epithelial damage), prolif- 
erating cells in the intestine can be labelled with nucleotide analogues. 
Small numbers of label-retaining cells (LRCs) persist for around 
4 weeks after homeostasis is (re)established'*. On subsequent regene- 
rative challenge some LRCs can re-enter the cell cycle thereby dem- 
onstrating their proliferative potential’’. Quiescent cells identified in 
this way are predominantly located in a supra-Paneth cell position 
within the crypt. Several candidate markers or regulators of qui- 
escence with expression patterns that overlie this location have been 
identified. Bmil-expressing cells in the proximal small intestine are 
capable of clonal expansion, have low Ki67 positivity and are respon- 
sible for repopulation of the Lgr5* stem-cell population after con- 
ditional Lgr5 cell deletion*"”’. Wip1 phosphatase regulates stem-cell 
apoptosis in a p53-dependent manner and co-localizes with thymi- 
dine label-retaining cells in a supra-Paneth cell position’*. The 
tumour suppressor Pten acts to limit the numbers of intestinal stem 
cells and its inactive phosphorylated form has been shown to co- 
localize with 5-bromodeoxyuridine (BrdU)-positive label-retaining 
cells’*"®. It has also been shown that Hopx marks cells in the +4 
position, a small proportion of which are label-retaining and that 


are also capable of converting to a Lgr5 population and maintaining 
clonogenic growth®. Most recently two publications dispute the role of 
the negative regulator of ErbB, Lrig1. One proposes that it is a unique 
marker of quiescent stem cells distinct from the bulk Lgr5 population, 
whereas the other identifies it as a general regulator of the stem-cell 
compartment*’. Furthermore, a recent report has demonstrated that 
all previously identified “quiescent stem-cell’ markers appear highly 
expressed by the Lgr5 population’’. In summary, the exact nature of 
quiescent crypt cells remains unknown because candidate markers 
rather than the label-retaining population itself have been the focus 
of study. 


Identification of label-retaining cells 


To study quiescence directly we identified cells retaining nuclear- 
localized fluorescent H2B-YFP during a chase period following a 
pulse of induced expression. Transgenic Cyp1al-H2B-YFP mice were 
evaluated before and after a pulse of B-naphthoflavone (BNF) treat- 
ment (Fig. la, b). The Cyp1al promoter has an established pattern of 
expression on induction in all cells of the crypt-villus axis with the 
exception of the mature Paneth cells'®. Twenty-four hours after in- 
duction (TO) cells throughout the crypt to villus axis expressed H2B- 
YFP with, as expected, the exception of Paneth cells (Fig. 1b-d and 
Supplementary Video 1). By 7 days crypt expression was restricted to 
multiple cells of the crypt base (Supplementary Fig. 1a). These cells 
decrease in frequency with time, becoming absent by 12 weeks (Sup- 
plementary Fig. 1b-d). Confocal microscopy analysis after 1 week 
suggested that a number of positive cells were Paneth cells due to 
characteristic enlarged and rounded nuclear morphology; this was 
confirmed using Paneth cell markers (Supplementary Fig. le, f). 
Non-Paneth LRCs were selected as the population of interest and, 
from 10 days post-induction, were defined as YFP-labelled LRCs 
(YFP-LRCs) in accord with published definitions of intestinal LRCs 
(Fig. le)*°!*, YFP-LRCs were present throughout the crypt base, 
slightly predominating in the +3 position, and were identifiable in 
decreasing numbers for up to 4 weeks (Fig. 1f, g). Subsequently, YFP 
Paneth cells were seen up to 8 weeks after this time (that is 12 weeks 
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Figure 1 | Identification and isolation of YFP-LRCs. a-—f, Time course 
images of intestinal epithelium from Cyplal-H2B-YFP mice after induction 
with BNF. a, Uninduced control. A very small number of background YFP* 
cells are present on duodenal villus tips (b, c) TO. d, TO three-dimensional 
reconstructed z-stack image of an isolated crypt showing that Paneth cells 
identified with UEA (red) do not initially express YFP (yellow). e, Three weeks 
post-induction image demonstrating a typically positioned YFP™ cell above 
UEA* (red) Paneth cells. f, Non-Paneth YFP* LRCs are distributed 


post-induction) but not thereafter, which accords with the prev- 
iously reported lifespan of terminally differentiated Paneth cells of 
6-8 weeks'*. By 3 weeks YFP-LRCs were reduced to 0.4 + 0.13 per 
crypt section, approximating to 2 per whole crypt. No YFP-LRCs 
were seen in the colon. 

The above estimate for YFP-LRC number is 50 times higher than 
previously described for LRCs identified by retention of nucleotide 
analogues (0.008 per crypt section)°®. Confocal microscopy on dual 
pulse-chased EdU/H2B-YFP mice determined that EAU-LRCs were 
recognized at a frequency of 0.009 per crypt section and that EdU- 
LRCs (27 out of 30) are YFP*, demonstrating that the majority (90%) 
of EdU-LRCs can be identified by YFP label retention (Fig. 1h). The 
greater abundance of YFP-LRCs (2-3 per crypt) demonstrates that 
nucleotide labelling incompletely marks slowly cycling populations, 
presumably because the method only marks cells captured in S phase 
before acquiring quiescence”. 


LRCs have a combined secretory and stem-cell signature 


After tissue dissociation, YFP-LRCs were isolated by flow sorting 
for transcriptional profiling. A strategy combining UEA lectin (for 
Paneth cell selection) and CD24 (lower crypt specific) was used to 
compare three populations independent of stem markers associating 
with cell cycle status (Fig. li, j and Supplementary Figs 1f, g and 2a, b)’. 
Expression microarray analysis showed that YFP-LRCs (CD24° 
UEA YEP*) are distinct from both Paneth (CD24*UEA”*) and 
cycling lower crypt cells (LCCs) (CD24*UEA™ YFP’ ) (Supplemen- 
tary Fig. 2c). Notably, transcripts associated with the Paneth and 
enteroendocrine lineages (Mmp7, Kit, Chga, Gip, Pax6) as well as 
stem-cell markers (Lgr5, Lrig1, CD133, CD44, Peg3) were more abun- 
dant in YFP-LRCs compared to LCCs. To establish that the YFP-LRC 
signature did not arise as a composite of the other two differentiated 
cell populations, we undertook single-cell profiling of 48 cells from 
each of the three groups for 47 transcripts selected either as stem 
cell/differentiation markers or YFP-LRC-specific from the array com- 
parison (see Methods). Principal component analysis demonstrated 
that YFP-LRCs were a discrete homogeneous population separate 
from both Paneth cells and LCCs but with no internal structure 
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(Fig. 2a). Nfatc3, CD83 and Nfat5 were the top three principal compo- 
nents defining YFP-LRCs. It was confirmed that larger numbers of 
YFP-LRCs expressed higher levels of Paneth/enteroendocrine cell 
markers than LCCs including Mmp7 and Chgb (Fig. 2b). Homo- 
geneity of the YFP-LRC population was further validated using immu- 
nofluorescence for differentiation markers on flow-sorted populations 
(Supplementary Fig. 3). 

Unexpectedly, given the quiescent status of YFP-LRCs, the micro- 
array showed elevated expression of Lgr5 in YFP-LRCs compared to 
the LCCs. To determine the degree of similarity with the Lgr5 popu- 
lation, we used previously published microarray data’? and found a 
highly significant degree of overlap in overexpressed genes between 
YFP-LRCs and Lgr5-GEP™ cells (P < 0.0001) (Fig. 2c). Furthermore, 
quantitative polymerase chain reaction with reverse transcription 
(qRT-PCR) comparison of YFP-LRCs with Lgr5-GFP™ cells isolated 
from Lgr5-EGFP-IRES-creERT2 mice demonstrated equivalent ex- 
pression not only of Lgr5 but also ‘quiescent’ stem-cell markers Tert, 
Lrigl1 and Hopx. In addition, there was upregulated expression of 
Paneth and enteroendocrine lineage genes in YFP-LRCs compared 
to Lgr5-GFP” cells (Fig. 2d). These comparisons suggest that most 
YFP-LRCs are a subpopulation of Lgr5-expressing cells. This was 
confirmed using spectral detection confocal microscopy to overlay 
YFP and GFP fluorescence on Cypla1-H2B-YFP/Lgr5-EGFP-IRES- 
creERT2 mice to confirm that 98% (40 out of 41) of YFP-LRCs also 
expressed Lgr5 (Fig. 2e). 


Elevated Lgr5 defines LRC maturation into Paneth cells 


YFP-LRCs have a combined secretory and stem-cell signature, indi- 
cating that they may have a secretory fate. Confocal microscopy was 
first used to track YFP inheritance into the Paneth cell population 
from 10 days (when all YFP-LRCs are formally defined) to 21 days 
post-induction (Fig. 3a). YFP-LRCs decrease in number over time 
whereas the number of YFP Paneth cells increases (Fig. 3b). This 
reciprocal relationship shows that one fate of YFP-LRCs is differenti- 
ation to Paneth cells. Moreover, the increasing number of YEP* 
Paneth cells over the 11-day chase accounts for 3.6 new Paneth cells 
per crypt (see Methods). This chase period is relatively short compared 
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Figure 2 | YFP-LRCs are a discrete Lgr5-expressing subpopulation. 

a, Principal component analysis (PCA) on single-cell Fluidigm qRT-PCR data 
from 48 single YFP-LRCs (red), LCCs (green) and CD24*UEA* (blue) cells. 
b, Summary table of single-cell data showing the number of cells expressing 
secretory cell transcripts (n = 48) and the log fold change expression level of 
expressing cells (Av FE) relative to LCCs. c, Venn diagram demonstrating the 
overlap in overexpressed genes between YFP-LRCs and Lgr5* cells. d, dRT- 
PCR data comparing the expression patterns of YFP-LRCs and Lgr5* cells. 
Data are mean + s.e.m. ***P < 0.005. Two-tailed Student’s t-test. 

e, Representative confocal microscopy image of a Lgrs-GEP*/YFP-LRC cell. 
YFP, red; Lgr5, green. Scale bar, 10 pm. 


to the turnover time of Paneth cells (42 days), indicating that the direct 
conversion of LRCs to Paneth cells without any cell division could 
create around 14 (3.6 X 42/11) Paneth cells per crypt over this time. 
With an estimated 15 Paneth cells per crypt it is likely that most, if not 
all, Paneth cells originate via LRCs. 

We next investigated whether a subset of Lgr5* cells could be 
identified with maturing Paneth cell features independent of the 
label-retaining marker and further differentiated in Paneth cell status 
to YFP-LRCs. A subset of Lgr5-GFP™ cells that stained with UEA asa 
marker of Paneth cells was identified and isolated by flow sorting and 
confirmed to be enriched for Paneth cell transcripts Mmp7 and Defa5 
(Fig. 3c-f). 

To determine alternative fates for YFP-LRCs, the upper crypt 
region was analysed at 14 days post-induction for the presence of 
YFP* cells transiting from the crypt base. Such cells are infrequent 
owing to the rapid rate of migration through the crypt (Fig. 3g)’. The 
vast majority of YFP” cells (24 out of 25) present above the crypt base 
were chromogranin A* enteroendocrine cells. This proportion effec- 
tively eliminates the numerically more abundant absorptive or Goblet 
cells as arising from YFP-LRCs. Overall, LRCs appear to be a Paneth 
cell precursor population but also capable of some enteroendocrine 
differentiation. 


LRCs possess multi-lineage growth potential 

To determine whether YFP-LRCs are irreversibly committed to a sec- 
retory fate we functionally assessed their growth potential in organoid 
culture”. YFP-LRCs and Lgr5-GEP™ cells plated at the same densi- 
ties were roughly equivalent in ability to form organoids (0.15% 
versus 0.25%). The organoids from YFP-LRCs demonstrated crypt 
budding at around 7 days and were found to resemble intact crypt- 
villus units as previously described (Supplementary Fig. 4). YFP- 
LRCs are therefore capable in a regenerative setting of acquiring 
stem-cell characteristics. 
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Figure 3 | Acquisition of Paneth and enteroendocrine cell characteristics 
from YFP-LRCs. a, Image demonstrating that after 4 weeks all YFP™ cells are 
Paneth cells as confirmed by lysozyme staining. b, Assessment of the number of 
YFP* cells between 10 and 21 days demonstrates a reciprocal relationship in 
cell types: YFP* Paneth cells (YFP-PC) increase in frequency as YFP-LRCs 
decrease (mean = s.e.m.); n = 100. ***P < 0.001. Mann-Whitney U-test. 

c, FACS analysis showing that around 15% of Lgr5-GFP™ cells are marked by 
UEA. d, Image-stream images showing examples of Lgr5-GFP™UEA and 
Lgr5-GEP"UEA * cells from c. Note the greater intensity of GFP in the latter. 
e, Confocal image showing a GFP’ UEA" cell. Left panel: UEA (red), Lgr5- 
GFP (green) and DAPI (blue). Middle panel: Lgr5—GFP and DAPI. Right panel: 
Lgr5-GFP alone. f, RT-PCR data showing upregulation of Paneth cell 
transcripts Mmp7 and Defa5 in the GFP“UEA* subpopulation. 

(mean = s.e.m.); *P < 0.05;**P < 0.01. Two-tailed Student’s t-test. NS, not 
significant. g, Representative image of rare YFP“ chromogranin A~ 
enteroendocrine cells in the upper crypt at 2 weeks post-induction. Scale bars: 
10 jm (a, e, g). 


LRCs contribute to the stem-cell pool only after injury 


To functionally assess whether YFP-LRCs have stem-like characte- 
ristics in vivo we used a novel dimerizable Cre recombinase strategy 
(Fig. 4)”. dicreAB mice are capable of clonally marking cells when two 
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inactive Cre fragments heterodimerize to form functional recombi- 
nase by the binding of Ariad rapamycin analogues to FKHB-ligand- 
binding domains present in both fragments. The carboxy-terminal 
peptide of Cre (CreB) is expressed constitutively from the Rosa26 
locus whereas the amino-terminal peptide (CreA) is expressed as an 
H2B-CreA fusion protein under the control of the BNF inducible 
promoter, Cyplal (Fig. 4a, b). Treatment of double transgenic Rosa26- 
creB and Cyplal-H2B-creA mice with BNF induces expression of 
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Figure 4 | LRCs only demonstrate clonogenicity after injury in vivo. 

a, Diagrammatic representation of diCreAB construct showing position of 
FKHB dimerizing domain in protein fusions to both N terminus (H2B-creA) 
and C terminus (creB) regions of a split cre gene. b, Representation of the in vivo 
dimerization of CreA and CreB to form a functional Cre recombinase. 
Dimerizing agent (AP20187) (green) mediates dimerization of two FKHB 
domains thereby creating a functional Cre complex. c, Generic schematic of 
induction protocol to test in vivo clonogenicity of cells expressing creA. BNF 
induces H2B-creA against a background of constitutive creB expression. With 
time (tx) following BNF administration H2B-creA is first expressed and then 
becomes restricted. Expressing cells are tested for their ability to generate clones 
after treatment with dimerizing agent (AP20187) that allows a functional Cre 
complex to form and thereby mediate recombination of a cre reporter gene. 
d, Schematic showing conditions for clone formation. AP20187 is sufficient for 
clone formation 0-2 days post BNF (top row) but not between 3-13 days 
(middle row). Between 6-13 days combined injury and AP20187 are required 
for clone formation (bottom row). In the clone shown, the injury was 
hydroxyurea. e, Numbers of clones formed over the total murine intestinal tract 
in the absence of injury (blue) and in the presence of injury (red) and showing 
that treatment with AP20187 only generates clones in the absence of injury up 
to 2 days after BNF treatment. When AP20187 treatment is combined with 
injury, clones are generated up to 14 days post BNF induction of H2B-creA 
(squares, hydroxyurea; stars, irradiation; triangles, doxorubicin). f, Clone width 
(measured in cell diameters) comparisons from data acquired at 3 weeks after 
induction of clones with BNF in AhcreERT mice (n = 277)* and 3 weeks after 
dimerization of Cre with AP20187 in dicreAB mouse (n = 23) induced with 
BNF and then dimerized and injured 13 days later. 
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H2B-creA throughout the epithelium on a background of constitutive 
creB expression. With time after BNF induction only LRCs retain 
H2B-CreA, in a manner analogous to H2B-YFP. Administration of 
intravenous dimerizing agent causes the two fragments of Cre to form 
a functional protein with subsequent downstream reporter expression. 

By varying the interval between BNF induction and Cre dimeriza- 
tion we show that, in the absence of injury, crypt cells form clones only 
up to 3days between BNF induction and dimerization (Fig. 4c, d). 
With longer induction and dimerization intervals (>3 days) clone 
formation is not observed. However, induction—dimerization inter- 
vals of up to 14 days induced clone formation when combined with 
intestinal injury (Fig. 4e). These clones were noted to contain all 
differentiated cell lineages at appropriate frequency (Supplementary 
Fig. 5). Clone size analysis compared to work we have previously 
published for pulse-chase clone induction after tamoxifen treatment 
demonstrates that LRC-derived clones occupy the predicted size dis- 
tribution expected for clones of equivalent age (3 weeks) and that 
arises from clone expansion due to neutral drift and leads to clone 
fixation as clones occupy whole crypts (Fig. 4f)*. 

These results demonstrate that rapidly cycling stem cells (not 
having diluted CreA within 3 days) are, as expected, competent to 
produce clonal progeny in the steady state. The lack of clone forma- 
tion with longer induction-dimerization intervals could either be 
due to insufficient bioavailability of H2B-CreA for recombination 
or that H2B-CreA-retaining cells are not clonogenic. The injury 
models indicate the latter, demonstrating that LRCs do not maintain 
the stem-cell pool during homeostasis. Under conditions of injury/ 
regeneration LRCs are capable of clonogenic growth by recall to the 
self-renewing pool of stem cells. To our knowledge this is the first 
demonstration of in vivo lineage tracing based on label retention in 
a mammalian system. 


Mature Paneth cells are not proliferative 


The differentiation of YFP-LRCs into Paneth cells has the con- 
sequence that both cell types co-exist. To confirm that Paneth cells 
are indeed terminally differentiated and not able to proliferate after 
damage, H2B-YFP mice were induced with BNF in a pulse-chase 
experiment with a chase period (5 weeks) that extended beyond the 
lifespan of YFP-LRCs. These animals were then irradiated and the 
response of the residual population of YFP~ Paneth cells determined. 
Incontrol (non-irradiated) animals YFP* cells were invariably isolated 
Paneth cells. After radiation treatment (at 1, 2 and 7 days), intestinal 
sections were examined for the doublets or strings of YFP* cells that 
would arise if Paneth cells were undergoing post-irradiation division. 
The pattern of YFP” positivity remained unchanged from the controls, 
suggesting that Paneth cells do not divide (Supplementary Fig. 6). 


Discussion 


There are a number of alternative functional roles for a quiescent 
population. First, they could be a stem population co-existing and 
in dynamic equilibrium with the rapidly cycling cells°. The dynamics 
of stem-cell renewal, however, have shown that the turnover of a 
single population of rapidly dividing cells is sufficient to drive changes 
in clone size distribution, leading to clones that occupy whole 
crypts*”. Second, they could be a dormant or reserve stem-cell popu- 
lation that is only active under conditions of epithelial stress or 
injury’®. Third, they could be terminally differentiated Paneth cells’. 
Last, and not previously described, they could be committed to dif- 
ferentiate and no longer contribute to stem-cell functions. 

Our observations establish the functional role of LRCs. They are 
normally destined to become Paneth and enteroendocrine cells but 
retain the ability to reacquire stem-cell function and can be recruited 
to serve as a functional clonogenic stem population under conditions 
of regeneration. Historically it is accepted that following stem-cell abla- 
tion cells normally destined for differentiation can regenerate the 
stem-cell compartment’”. However, the current findings demonstrate 
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directly that a committed precursor can be recalled to the stem-cell 
pool after damage in a similar manner to that shown recently for DIl1 ~ 
secretory precursors”. 

Our observations resolve the apparent paradox that quiescence 
markers are expressed within a population of Lgr5~ cells that are 
viewed as rapidly cycling'’. Around 20% of Lgr5* cells are largely 
quiescent and continue to express Lgr5 before Paneth cell maturation. 

Ler5* cells have been implicated as cancer stem cells in colonic 
cancers, and higher frequencies of Lgr5* cells in intestinal cancers 
correlate with more extensive Paneth cell differentiation®’. Paneth 
cells may nurture cancer stem cells by niche generation as they do 
normal stem cells in culture. It will be important to determine the 
extent to which quiescent secretory intermediates retaining clono- 
genic potential are maintained in colon cancers and establish the 
factors that activate them. 


METHODS SUMMARY 


Mouse models and inductions. Cyp1al-H2B-YFP mice have previously been 
described’. dicreAB mice were generated as follows: creA and creB are two segments 
of the cre coding sequence comprising codons 19-59 and 60-343, respectively, and 
were fused to other protein motifs with flexible F2 linkers as described previously”* 
(see Methods for details). 

Cyplal was induced by X3 intraperitoneal BNF injections over 24h 
(80 mgkg *). Dimerization of Cre fragments was performed by X2 intravenous 
AP20187 (Ariad) injections (10 mg kg — y. 

Single-cell (RT-PCR. FACS-sorted single cells underwent single-cell RNA amp- 
lification (see Methods for details). Generated cDNA libraries were then used for 
Fluidigm Biomark high-throughput qRT-PCR against custom-designed and 
validated primers (see Methods for details). Principal component analysis was 
carried out in R. 

Quantitative PCR. qRT-PCR was performed using standard TaqMan or 
SybGreen assays. Custom primers were validated before use. For experiments 
where cell numbers were limiting, cDNA was amplified using a custom protocol 
(see Methods for details). Fluidigm qRT-PCR was performed as per the manu- 
facturer’s protocol. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Mice models. Cypla1-H2B-YFP mice have previously been described’. Briefly, 
Cyplal-H2B-YFP mice were created using conventional cloning techniques 
involving removing the human H2B coding sequence from the pBOS-H2BGFP 
vector (BD Pharmigen) and then ligating this into pIRES-eYFP (BD Biosciences). 
Lgr5-EGFP-IRES-creERT2 mice were purchased from the Jackson Laboratory. 

dicreAB mice were generated as follows: creA and creB are two segments of the 
cre coding sequence comprising codons 19-59 and 60-343, respectively, and were 
fused to other protein motifs with flexible F2 linkers as described previously”. 
The FKBP dimerizing domain and AP20187 was a gift provided by Ariad 
Pharmaceuticals. creA was synthesized as a single fusion gene comprising in 5’ 
to 3’ order H2B-F2 linker-FKBP-F2 linker-creA-pA with flanking BglII sites to 
permit excision. 

H2B-F2 linker-FKBP-F2 linker-creA-pA was subcloned using BglII into pAh1R1 
such that the fusion gene was under the control of 12-kb of the rat CYP1A1 
promoter element’®. The cassette comprising Cyplal-H2B-F2 linker-FKBP-F2 
linker-creA-pA (or Cypla1-H2B-LBD-creA) was excised from the plasmid back- 
bone using NotI and this fragment purified for pronuclear injection. Oocyte injec- 
tion was performed by the Transgenic Lab, within the CRI. 

CreB was synthesized as FKB-F2 linker-creB-pA with flanking Nhel sites for 
excision (sequences available on request). This cassette was cloned in the Nhel 
site of ROSmcs13 as described previously”. The linearized plasmid was intro- 
duced into ES cells and selected with puromycin. The S6B6 ES cell line was 
derived from 129/sv/C57BL6/j hybrid embryos. Initial screening of ES-derived 
colonies was confirmed by long-range PCR and sequencing and mice generated 
as previously described**. 

Mice were bred and housed according to UK Home Office guidelines. For 

induction of Cyplal-H2B-YFP mice, 3 intraperitoneal injections of 80mgkg ' 
of BNF over a 36-h period were used. TO was defined as the same day as the last 
BNF injection. For inductions of dicreAB mice, 5 daily intraperitoneal injections 
of 80 mg kg’ BNF were used. Dimerization of Cre was performed using two 
intravenous injections of AP20187 10mgkg ' (Ariad Pharmaceuticals) over 
2 days. Injury models involved single doses: of hydroxyurea (intraperitoneally, 
1gkg '), doxorubicin (intraperitoneally, 10 mgkg ') that target DNA repli- 
cating cells (by inhibition of nucleotide incorporation and topisomerase res- 
pectively) and y-irradiation (6 Gy) that preferentially kills rapidly cycling cell 
as they attempt to replicate damaged DNA template. 
Tissue preparation, immunofluorescence and whole-mounting. Preparation of 
tissues for immunofluorescence was performed by fixation in 4% PFA for 48h 
followed by 20% sucrose for a further 48h. Tissue was then embedded in OCT 
and sectioned at 6 j1m. Antigen retrieval was performed using 1% SDS for 5 min 
and then blocked with 5% NDS for 30 min. Sections were stained overnight at 
4°C for all primary antibodies and 2h at room temperature for UEA. Secondary 
antibodies were applied for 1h at room temperature. Nuclear staining was 
achieved using ProLong Gold antifade reagent with DAPI (Invitrogen). Sections 
were visualized on a C1Si Nikon confocal microscope. 

For UEA and lysozyme co-expression experiments, primary anti-lysozyme was 
incubated overnight at 4°C followed by incubation with UEA-AF647 and an 
AF555-anti-rabbit secondary for 2h at room temperature. Differentiated 
Paneth cells were seen to overlay with both markers but in addition UEA marked 
a subset of goblet cells in both the upper crypt and villus. 

Whole-mounting of intestines and detection of SYNBGlu activity was carried 

out as previously described*. Organoids were prepared for immunofluorescence 
as previously reported”. Clone sizes were measured as previously described*. 
Comparisons were made between diCreAB clones at 3 weeks and those analysed 
in this publication at the same time. 
Confocal counting experiments. For assessment of YFP-LRC position, YFP- 
LRC number and YFP-Paneth cell number, 100 crypts were counted for each 
time point in the proximal small intestine. Position +1 was defined as the apical 
two cells in the crypt and subsequent positions followed superiorly. Illuminating 
laser intensity was set uniformly between all comparisons to allow for repro- 
ducible and comparable detection of YFP positivity. Paneth cells were identified 
based on cytoplasmic and/or membranous UEA positivity. Countable crypts were 
initially detected by DAPI confirmation of a complete half crypt. The crypt of 
interest was then assessed for UEA or YFP positivity up to position +6. Label- 
retaining index was calculated as the mean number of YFP-LRCs at crypt posi- 
tions +1 to +6 2 weeks after BNF induction. 

EdU and YFP dual labelling experiments were performed by injecting mice 
with three injections of 1 mg per mouse (intraperitoneal) EdU over 24h followed 
by three (intraperitoneal) BNF (80 mgkg~ 1) injections over the next 24h. Mice 
were killed 14days later and EdU visualized using the Invitrogen Click-iT 
imaging detection kit as per the manufacturer’s protocol. 


The murine crypt has a population of 250 cells*®. Confocal assessment of an 
optimally sectioned 21m optical section of a small intestinal crypt shows 
39.3 + 1.1 cellular nuclei. A single section through a crypt is therefore represent- 
ative of 16% of the total number of crypt cells. 

Images were acquired using a C1Si Nikon confocal microscope. Images were 

edited on EZ-C1 3.20 FreeViewer and Adobe Photoshop CS3. Volocity software 
was used to generate the H2B-YFP crypt three-dimensional reconstruction and 
video. 
Flow cytometry. Intestinal epithelial single-cell preparation was performed as 
previously described*". Briefly, after dissection intestines were everted and fed 
onto either 3-mm (distal SI) or 4-mm (proximal SI) diameter glass rod spirals. 
They were then incubated in 50ml of HBSS without Ca** and Mg** sup- 
plemented with 10 1M EDTA and 10 mM NaOH at 37 °C for 1h. Every 10 min 
the spirals were pulsed using a vibrating stirrer (Chemap AG, model CH-8604) 
and the HBSS collected and replaced with fresh HBSS. Fractions 2-6 were 
pooled and then re-suspended in 1X 0.05% Trypsin-EDTA at 37 °C for 7 min 
with regular agitation. Epithelial samples were filtered through a 70 |tm mesh into 
cold 2% FBS/PBS and washed twice more. 

After staining with antibodies and UEA, samples were sorted or analysed using 
either a BD FACS Aria SORP or BD LSR II, respectively. Single stained and 
unstained controls were always used, and propidium iodide (2.5 ,1gml-') was 
used to discriminate between live and dead cells. 

For flow cytometric assessment of the percentage of YFP-LRCs, the denom- 
inator was defined as the number of single, live cells from analysis of 50,000 cells 
of pooled fractions 2-6 of a single cell epithelial preparation; that is, crypt 
enriched. The numerator was defined as the number of CD24* UEA™ YEP* cells; 
that is, non-Paneth-cell, crypt-located YFP-LRCs from the same population. 

Single-cell fluorescent images were acquired using an Amnis Imagestream. 

FACS data were analysed using FlowJo v7.5.5 (TreeStar). 
Affymetrix exon microarray and analysis. Six age- and sex-matched Cyp1a1- 
H2B-YFP mice 10 days after BNF induction were used comparing three cell 
populations from each mouse. After flow sorting 30,000 cells from the respective 
populations, RNA extraction was performed using the RNeasy Micro kit (Qiagen) 
with on column DNase digestion. RNA quality was assessed using a 2100 
Bioanalyzer (Agilent Technologies). 

RNA amplification and hybridization was performed at the Paterson Institute, 
Manchester, UK, Microarray Facility using Nugen Ovation Pico WTA System for 
amplification and then hybridization to a Mouse Exon 1.0 ST Array. Arrays were 
scanned using an Affymetrix GeneChip scanner 3000 running GCOS software. 

The raw data were RMA (robust multi-array average)** normalized using the 

Affymetrix Power Tools (APT) software package and the analysis was restricted 
to ‘core’ probe sets which are supported by Refseq** annotations. The data set was 
filtered on both the cross-hybridizing and undetected probe sets** and further 
processing was carried out in R® using a number of Bioconductor Packages”*. 
Differential expression analysis was carried out in limma” and the hyper- 
geometric test was used to test for significant overlap with Lgr5* cells. The 
Benjamini-Hochberg method for multiple correction was applied to P values 
from both methods. The data were deposited in the GEO database (accession 
number GSE43772). 
Organoid culture. Culture of organoids was performed as previously described 
and including the use of Wnt3A (R&D) (100 ng ml’) for the first 3 days after 
plating’’. Cells were plated in 24-well plates at a density of ~50,000 cells in 50 ul 
matrigel. Organoids were counted as budding organoids at 10 days post seeding. 
Single-cell expression profiling. For single-cell expression profiling, single cells 
were flow-sorted into 96-well PCR plates (Star Lab) with 4 ul of first-strand 
synthesis buffer in each well containing 1 jl 5x SuperScript III buffer (Invitro- 
gen), 0.5 ul 5% NP-40 (Pierce), 0.25 ul 1mM dNTP mix (Thermo Scientific), 
0.075 pl 1 uM MO4d(T) primer (DNA Technology A/S), 0.05 ul 0.1M DTT 
(Invitrogen), 0.25 ul SuperRNaseIN (Ambion), 0.25 pl RNase OUT (Invitro- 
gen), 1p Spike Arabidopsis DNase treated total RNA 10 'pgul ' (gift from 
A. Giakountis) and 0.625 tl nuclease-free H,O (Ambion); and then incubated at 
65 °C for 5 min followed by 45 °C for 2 min. Then 0.5 pil of SuperScript III (Invi- 
trogen) was added to each aliquot and incubated at 45 °C for 15 min and then 65 °C 
for 10 min. 

One microlitre of exonuclease mix was then added containing 0.2 pl exonu- 
clease I (Thermo Scientific), 0.6 1 75 mM MgCl (Ambion) and 0.2 ll nuclease- 
free HO (Ambion) and incubated at 37 °C for 30 min and 80 °C for 25 min. 

Seven microlitres of a reaction mix was then added containing 2.6 p15 TdT 
buffer (Promega), 0.097 p11 100 mM dATP (Thermo Scientific), 0.5 pl RNase H 
(Invitrogen), 0.5.1 TdT (Promega), 3.303 pil nuclease-free HO (Ambion) and 
incubated at 37 °C for 15 min and 70 °C for 10 min. 

Sixteen microlitres of amplification mix 1 was then added to 4 ul of poly- 
adenylated cDNA in triplicate, containing 21 10x ExTaq buffer (Takara), 
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0.13 pl 100 mM dATP/dGTP/dCTP/dTTP (Thermo Scientific), 1.65 1 100 1M 
MO,d(T) primer (DNA Technology A/S), 0.2 tl ExTaq polymerase (Takara) and 
11.63 jul nuclease-free HO (Ambion) and cycled for 94°C for 1 min, 50°C for 
2 min and 72 °C for 2 min followed by 35 cycles of 94 °C for 30 s, 60 °C for 30s and 
72°C for 2 min. 

The triplicate products were then pooled and 2 ll of pooled product was added 
in duplicate to 18 pl of amplification mix 2 containing 2 ul 10x ExTaq buffer 
(Takara), 1.6 pl 2.5mM dNTP mix (Takara), 0.44 100M T7-MO4 primer 
(DNA Technology A/S), 0.2 ul ExTaq polymerase (Takara) and 13.8 pl of 
nuclease-free water (Ambion) and cycled for 35 cycles at 94°C for 30s, 60°C 
for 30s and 72 °C for 2 min. The duplicate samples were then pooled. Subsequent 
cDNA was then cleaned up using Zymo Clean and Concentrate kit and nor- 
malized based on Nanodrop quantification. 

Normalized single-cell amplified cDNA was run on the Fluidigm Biomark 

EvaGreen qRT-PCR system against custom-designed primer pairs. Data were 
analysed using Fluidigm Real-Time PCR Analysis software (v3.0.2) and samples 
were normalized to spike Arabidopsis RNA primers LTP4 and TIM. Principal 
component analysis was carried using the prcomp function of the R statistical 
computing and graphics language. Cells were eliminated from PCA if the mouse 
housekeeping genes had failed to fire. 
RNA amplification. For Lgr5/UEA qRT-PCR, cell numbers were limiting so a 
modified single-cell RNA amplification protocol was used”*. Briefly, 1 pl of 
extracted total RNA underwent first-strand synthesis in a buffer containing 
1pl 5X SuperScript III buffer (Invitrogen), 0.5 ul 5% NP-40 (Pierce), 0.25 pl 
1mM dNTP mix (Thermo Scientific), 0.075 pl 1 uM MO,d(T) primer (DNA 
Technology A/S), 0.05pl 0.1M DTT (Invitrogen), 0.25 pl SuperRNaseIN 
(Ambion), 0.25 pl RNase OUT (Invitrogen) and 0.625 pl nuclease-free H,O 
(Ambion); and then incubated at 65 °C for 5 min followed by 45°C for 2 min. 
0.5 pil of SuperScript III (Invitrogen) was then added to each aliquot and incu- 
bated at 45 °C for 15 min and then 65 °C for 10 min. 

One microlitre of exonuclease mix was then added containing 0.2 pl exonu- 
clease I (Thermo Scientific), 0.6 175 mM MgCl, (Ambion) and 0.2 pl nuclease- 
free H,O (Ambion) and incubated at 37 °C for 30 min and 80 °C for 25 min. 

Seven microlitres of a reaction mix was then added containing 2.6 p15 TdT 
buffer (Promega), 0.097 pl 100 mM dATP (Thermo Scientific), 0.5 pl RNase H 
(Invitrogen), 0.5 ,l TdT (Promega), 3.303 pil nuclease-free HO (Ambion) and 
incubated at 37°C for 15 min and 70 °C for 10 min. 

Sixteen microlitres of amplification mix was then added to 4 of poly- 
adenylated cDNA in triplicate, containing 2 ul 10X ExTaq buffer (Takara), 
0.13 pl 100 mM dATP/dGTP/dCTP/dTTP (Thermo Scientific), 1.65 pl 100 11M 
MO,d(T) primer (DNA Technology A/S), 0.2 tl ExTaq polymerase (Takara) and 
11.63 pl nuclease-free HyO (Ambion) and cycled at 94°C for 1 min, 50°C for 
2 min and 72 °C for 2 min followed by 35 cycles at 94 °C for 30 s, 60 °C for 30s and 
72°C for 2 min. 

Subsequent cDNA was then cleaned up using Zymo Clean and Concentrate kit 
and normalized based on Nanodrop quantification. qRT-PCR was then per- 
formed as follows using custom-designed 3’ orientated primers. 
qRT-PCR. Sybr green qRT-PCR was performed under standard conditions 
using a Rotorgene RG3000 (Corbett Research) and data analysed using 
Rotorgene 6 software. Custom primers were validated before use using standard 
Sybr green qRT-PCR and agarose gel electrophoresis of PCR products. Samples 
were normalized to housekeeping genes f-actin or B2 microglobulin. 

TaqMan qRT-PCR was performed under standard conditions as above. 
Samples were normalized to housekeeping genes ribosomal protein L19 or glycer- 
aldehyde 3-phosphate dehydrogenase (Gapdh). 

Antibodies and lectin. Ulex europaeus agglutinin I (UEA)-Alexa Fluor 647, 
1:1,000 (custom conjugation; AbD Serotec). Anti-CD24-Pacific blue, 1:200 
(Biolegend). Anti-substance P, 1:100 (Millipore). Anti-lysozyme, 1:1,000 (Dako). 
Anti-chromogranin A, 1:400 (Abcam). Anti-villin, 1:100 (Abcam). 

Primers used for single cell-expression profiling. Abcal forward, 5'- 
TIGTTCCAAAGAGCCATGTG-3’; Abcal reverse, 5’-GGAATGAGGGCC 
AATGATAA-3'; Ape forward, 5'-TGGGAGATGGTTGCCAGGGT-3’; Apc 
reverse, 5’-GICGGAAAGATGCATCAATGGCCT-3’; Afll forward, 5’- 
TGATCAAAAATGCACAAAAATTG-3’; Afll reverse, 5’-AGAGCCAGTGA 
GCATTTGGT-3’; f-actin forward, 5'-AGGTGACAGCATTGCTTCTG-3’; 
B-actin reverse, 5’-AGGGAGACCAAAGCCTTCAT-3’; 82 microglobulin 
forward, 5’-ATTCACCCCCACTGAGACTG-3’; 82 microglobulin reverse, 5'- 
GCTATTTCTTTCTGCGTGCAT-3’; Bmil forward, 5'‘-TCATGGTGTTACCT 
AAGACAAAAGAC-3'; Bmil reverse, 5'-GGTAAAAAGCCTCATCCCAGA- 
3'; Cadps forward, 5'-TCACATTTTGCTTTTCTGTTTCA-3’; Cadps reverse, 
5'-CCAGGGTCGCTCAGTTACAG-3’; CD133 forward, 5'-ATCGGGGAAAC 
GAAGAAGTT-3’; CD133 reverse, 5’-ACAGCCGGAAGTAAGAGCAC-3’; CD44 
forward, 5'-CCCCCTTTCTTTTTCCAGTT-3’; CD44 reverse, 5’-ACTTTC 
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TGCCCCTCTCCACT-3’; CD82 forward, 5'-CAGGGTAGGCAATTCTTCCA- 
3'; CD82 reverse, 5'-AGCTGCCAAGAAACACCAGT-3’'; Cdk5rap2 forward, 
5'-TGCCAAGATGGATGTTCAAA-3’; Cdk5rap2 reverse, 5'-AAGAGCTTCA 
GCAACCTGGA-3'; Chga forward, 5'-GCAGAGGACCAGGAGCTAGA-3’; 
Chga reverse, 5'-CAGGGGCTGAGAACAAGAGA-3'; Chgb forward, 5'- 
ACAGGAAGAAGGCAGACGAA-3'; Chgb reverse, 5'-TCCTTCAGTGAAA 
GGCTCGT-3’; Csrnp1 forward, 5'-GCCCCTCTGTAAGATGGTGA-3’; Csrnp1 
reverse, 5’-GGCACACACACACACACAAA-3’; DCAMKL-1 forward, 5’- 
AGGTGGGCTGGGGACTTGACA-3’; DCAMKL-1 reverse, 5'-TGCAGCAAG 
TGACAAGGGCA-3’; Ddah1 forward, 5'-CCAAACACTAAGGCCGTCAT-3’; 
Ddah1 reverse, 5'-CACACATTGGCTGGAAGATG-3’; Defa5 forward, 5’- 
TCCTGCTCAACAATTCTCCA-3'; Defa5 reverse, 5'-GACACAGCCTGGT 
CCTCTTC-3’; DIL forward, 5'-TGAGCCAGTCTTTCCTTGAA-3’; DIlI reverse, 
5’-AGACCCGAAGTGCCTTTGTA-3’; Dil4 forward, 5'-CCCCTTTTGCCAGC 
CAGGGG-3'; DIl4 reverse, 5'-AGGTTGGTCTCGGGGCAGCA-3’; Dvi3 for- 
ward, 5’-ACACCTTAAGCCCACCCTTT-3’; Dvl3 reverse, 5’-GCTCTCTG 
GGGTTCTGATTG-3’; EphB2 forward, 5'-TCCTCCCTGTCTGCTCTCAT-3’; 
EphB2 reverse, 5'-CTGGGGTGTCACAGTGAATG-3’; EphB4 forward, 5’-TCC 
TGACTTGCTGTCACCAC-3'; EphB4 reverse, 5’-GAACCAGGTGCCCTTT 
AACA-3'; Fefr2 forward, 5’-TCGAAGGATGGCAAAAGCCCAGC-3'; Fefr2 
reverse, 5’-AGTGAGTGGGCGTATCCAAAGCA-3’; Frs2 forward, 5’- 
TCTACAGTGGGACTACCATTGCCA-3’; Frs2 reverse, 5’-ACAGTGGTTACG 
TITGCCAACCCA-3’'; Gip forward, 5'-GGGAAAGGAGGACAAAGAGG-3’; 
Gip reverse, 5'-AGCCAAGCAAGCTAAGGTCA-3’; Kif20a forward, 5'-ACAG 
GAACAGCAGTGGGAAC-3'; Kif20a_ reverse, 5’-CACTAACTGGGTGCCA 
TGTG-3'; Lgr5 forward, 5’-GGGCGTTAAGTCCACTGTGT-3’; Lgr5 reverse, 
5'-CGAACACCTGCGTGAATATG-3’; Lrigl forward, 5’-TTCCTTACCGGTG 
AGACTGG-3'; Lrigl reverse, 5'-CCATCACTGTGCCAACACTT-3’; LTP4 
forward, 5'-GTGCTGTGCAGGAGTCAAAA-3’; LTP4 reverse, 5'-TCTTCA 
GGCAAATGATGTCG-3'; Mcm6 forward, 5’-CTGAAAGGCTCCAGTGAA 
GG-3'; Mcm6 reverse, 5'-TCAAGCATTGCTCCAGACAC-3’; Mctp2 forward, 
5'-CATCTGGAGCGTAACCCCTA-3’; Mctp2 reverse, 5'-CAGTCACAAAAG 
CAGGTGGA-3'; Mmp7_ forward, 5’-CCCGGTACTGTGATGTACCC-3’; 
Mmp7 reverse, 5'-AATGGAGGACCCAGTGAGTG-3’; Tert forward, 5'-AGCC 
GCACATTGGCTCTGCT-3’; Tert reverse, 5’-TCGCCTCTGGGAGCTTCCGG- 
3’; Muc2 forward, 5'-GTCCGAAGTGTTACCCTGGA-3'; Muc2 reverse, 5'- 
CCAGGAGTGGAGAAGGTCAG-3’; Nfat5 forward, 5’-ACTGACCTGCCTT 
CTTGCAT-3'; Nfat5 reverse, 5'-TTGACTGCAGCTGTTTACAGAAA-3’; 
Nfatcl forward, 5'-TCATTTGCTCTGCACCAGTC-3’; Nfatcl reverse, 5’-GAC 
GAGAGTCAGGGAAGCAG-3’;  Nfatc3 forward, 5'-TGGGCTCAAGATGA 
AACCTT-3’; Nfatc3 reverse, 5’-TAACTGAGGAGGAGCCTGGA-3’; Ngn3 
forward, 5'-ATCTGCCGGCCTCCGACCAT-3’; Ngn3 reverse, 5'-GCCTGG 
TCTCCCTTGGGGGA-3'; Notch1 forward, 5'-TCAGTGTGACCCAGACC 
TTG-3’; Notch1 reverse, 5'-CAAAAGGCCAGAAAGAGCTG-3’; Pax6 forward, 
5'-GAACAACACAGGCTGTTGGA-3’; Pax6 reverse, 5'-TGTGTGTTGTCC 
CAGGTTCA-3’; Pbx1 forward, 5’-AGAAAGCCATGTGGTTGGAC-3’; Pbx1 
reverse, 5'-TGAAAAGGGTGCCTTTCATC-3’; Rfx6 forward, 5’-TTTTCCT 
CCCTCTTTGGTTTC-3’; Rfx6 reverse, 5'-GCACAGGGGTAGAAGGTCAA- 
3’; Ssh2 forward, 5'-CCACAGCCCACCAAGTATTC-3’; Ssh2 reverse, 5'- 
GGAAGGCCTCCTTCAGAACT-3';  Tacl forward, 5’-GATGAAGGAGCT 
GTCCAAGC-3’; Tacl reverse, 5'-TCACGAAACAGGAAACATGC-3'; TIM 
forward, 5’-ACGGAAATCGCTAGTCTCCA-3’; TIM reverse, 5’-TGTCAAAC 
CAGAGCTCACGA-3’'; Tph1 forward, 5'-AATTTGCCAAGACCGTGAAG-3’; 
Tph1 reverse, 5'-GCCCTGGCTCTAGACTGATG-3’; villin forward, 5'-ATT 
AGCGTCTGGGGGTTTCT-3’; villin reverse, 5’-CGTAGCAAACCCATGTT 
CCT-3'; Zbtb48 forward, 5’-CACATGGAGATCCACGACAG-3’'; Zbtb48 
reverse, 5'-CAGGGACTCCACAATGACCT-3’. MO4d(T) primer, 5’-AAGCA 
GTGGTATCAACGCAGAGTGGCCATTACGGCCGTACTTTTTTTTTTTTT- 
TITITTTTITTTTTTTT-3’; T7-MO4 primer, 5'-GGCCAGTGAATTGTAA 
TACGACTCACTATAGGGAGGCGGAAGCAGTGGTATCAACGCAGAGTG 
GCCATTACGGCCGTAC-3’. 

Taqman probes. Atoh1, Mm00476035_s1;Chgb, Mm00483287_m1; Chga, 
Mm00514341_m1; Clca3, Mm01320697_m1; Kit, Mm03053277_s1; Gapdh, 
Mm03302249_g1; Hopx, Mm00558630_m1; Lgr5, Mm00438890_m1; Lrigl, 
Mm00456116_m1; Lyz1, Mm00657323_m1; Mmp7, Mm00487724_m1; Muc2, 
Mm00458299_m1; Rpll19, Mm01606039_g1; Tert, Mm00436931_m1; villin, 
Mm00457074_m1. 
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Crystal structure of an RNA-bound 
11-subunit eukaryotic exosome complex 


Debora Lika Makino', Marc Baumgiirtner' & Elena Conti‘ 


The exosome is the major 3’-5’ RNA-degradation complex in eukaryotes. The ubiquitous core of the yeast exosome 
(Exo-10) is formed by nine catalytically inert subunits (Exo-9) and a single active RNase, Rrp44. In the nucleus, the 
Exo-10 core recruits another nuclease, Rrp6. Here we crystallized an approximately 440-kilodalton complex of 
Saccharomyces cerevisiae Exo-10 bound to a carboxy-terminal region of Rrp6 and to an RNA duplex with a 
3’-overhang of 31 ribonucleotides. The 2.8 A resolution structure shows how RNA is funnelled into the Exo-9 channel 
in a single-stranded conformation by an unwinding pore. Rrp44 adopts a closed conformation and captures the RNA 
3’-end that exits from the side of Exo-9. Exo-9 subunits bind RNA with sequence-unspecific interactions reminiscent of 
archaeal exosomes. The substrate binding and channelling mechanisms of 3’-5’ RNA degradation complexes are 


conserved in all kingdoms of life. 


RNA degradation serves a multitude of functions in all domains of 
life. RNA decay modulates the abundance of transcripts and is the 
ultimate step in quality control pathways that detect and eliminate 
defective RNA molecules'”. Degradation is also used to partially trim 
RNA precursors to their mature form’. The main cellular machinery 
responsible for degrading RNA molecules in the 3’-to-5’ direction is the 
RNA exosome complex'*. The exosome is conserved in all eukaryotes 
studied so far. Related complexes are also present in bacteria and 
archaea, highlighting the importance of this ancient molecular machine’. 

The common core of eukaryotic and archaeal exosomes is a barrel-like 
structure of nine polypeptides (Exo-9)'”. Exo-9 consists of six subunits 
with structural homology to RNase PH (a phosphorolytic 3’-to-5' exor- 
ibonuclease), and a ring of three subunits with $1 and KH domains (folds 
typically found in RNA-binding proteins)**. In prokaryotes, RNA sub- 
strates use the central channel to reach the three active sites confined in 
the interior of the RNase PH barrel’. In yeast and human Exo-9, how- 
ever, all RNase PH-like subunits contain amino acid changes in their 
putative active sites that renders them inactive**’°. The processive activity 
of eukaryotic exosomes resides in an additional subunit, Rrp44 (also 
known as Dis3)*°"°. Rrp44 associates with Exo-9 to form the catalytically 
active Exo-10 complex that is ubiquitously present in the cytoplasm and 
nucleus of eukaryotic cells. Exo-10 recruits additional factors and regu- 
lators that act specifically in either the cytoplasm or the nucleus'”. 

All Exo-10 subunits are essential in vivo in yeast*'!"’. Yet, Rrp44 
can bind and degrade RNAs in vitro in the absence of Exo-9 (refs 6, 10, 
13), raising questions about the function of this catalytically inert 
scaffold. We previously reported in vitro biochemical experiments 
suggesting that the channel in yeast Exo-9 is used to deliver RNA 
substrates to the processive RNase site of the tenth subunit'*. The 
Exo-9 channel was recently shown to be essential in vivo in yeast". 
To shed light on the molecular mechanisms of RNA binding and 
processing, we have determined the atomic structure of a complete 
S. cerevisiae exosome core bound to an RNA substrate trapped in a 
pre-cleavage state within the complex. 


Rrp6 stabilizes an exosome-RNA complex 


We reconstituted S. cerevisiae exosome complexes containing Exo-9 
and an Rrp44 mutant lacking endonucleolytic and exonucleoytic 


activities (Rrp44(D171N/D551N))°"*'*"8 (Supplementary Fig. 1a, b). 
Within Exo-9, the RNase PH-like proteins (Rrp41, Rrp45, Rrp46, 
Rrp43, Mtr3 and Rrp42) and two of the S1/KH ‘cap’ proteins (Rrp40 
and Rrp4) form a stable structural unit, whereas the third cap protein 
(Csl4) is a more labile component of the complex’”*”’. To stabilize the 
complex for structural analysis, we added an eleventh subunit, Rrp6 
(refs 21, 22). In in vitro RNase protection assays, Rrp6 binding 
to Exo-10 slightly but reproducibly increased the intensity of the 31- 
33-nucleotide fragments that correspond to a long RNA-binding path 
in Exo-10 (ref. 14) (Supplementary Fig. 1c). Concomitantly, Rrp6 
decreased the intensity of the 11-12-nucleotide fragments that reflect 
an alternative short RNA-binding path’* (Supplementary Fig. 1c). 
A construct encompassing only the C-terminal region of Rrpé6 (that 
is, lacking the nuclease region) had a similar stabilizing effect in RNase 
protection assays to the full-length protein (Supplementary Fig. Ic). 
We concluded that the C-terminal exosome-binding region of Rrp6 
(ref. 23) contributes either directly or indirectly to the long RNA- 
binding path of Exo-10. 

We narrowed down the exosome-binding region of Rrp6 to resi- 
dues 518-693 (hereafter referred to as Rrp6c term) (Supplementary 
Fig. lc and data not shown). The reconstituted Exo-10 complex 
bound to Rrp6cterm crystallized in the presence of a 5’-CCCC 
CGAGAGGGGGU3,-3’ RNA. This RNA substrate consists of a 
single-stranded 3'-overhang (designed to span the size of the frag- 
ments protected by Exo-10 in RNase protection assays") and a 5’- 
CG duplex closed by a tetraloop (designed to stall at the narrow 
entrance of the Exo-9 channel”). We determined the structure of 
Exo-10-Rrp6c-term-RNA by molecular replacement using the coor- 
dinates of human Exo-9 (ref. 6) and yeast Rrp44—Rrp41—Rrp45 (ref. 
14). The structure was refined to 2.8 A resolution with good statistics 
(Réree Value of 22.4%) and stereochemistry (Supplementary Table 1). 
The atomic model is complete for most of the polypeptides (Sup- 
plementary Fig. la). RNA has defined electron density for 17 
nucleotides in the single-stranded overhang (nucleotides 1-15 
and 30-31, as numbered from the 3’-end), and 13 nucleotides in 
the duplex (32-44). Additional weak density is present that corre- 
sponds to partially disordered oligonucleotides in the wider parts of 
the channel. 
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Structure of yeast Exo-10-Rrp6-RNA 


The structure of the Exo-10-Rrp6cterm-RNA complex shows the 
canonical overall architecture of Exo-9, with the ring of the cap 
proteins bound on top of the RNase PH barrel (Fig. 1). Rrp6 folds 
into two separate stretches that bind S1/KH and RNase PH-like 
proteins (Fig. 1, right). The first (amino-terminal) stretch (residues 
532-557) forms an o-helix that binds to Csl4 and Mtr3 (Fig. 2a and 
Supplementary Fig. 2a). The second (C-terminal) stretch (residues 
565-619) contains an unstructured region that is followed by a small 
B-hairpin and o-helix that contact Mtr3, Rrp43 and approach Csl4 
(Fig. 2a and Supplementary Fig. 2a). Although the exosome-binding 
region of Rrp6 shows altogether relatively weak sequence conser- 
vation, the second stretch of Rrp6 in particular docks onto a conserved 
surface of Exo-9 (Fig. 2a), suggesting that at least part of the Rrp6- 
binding site on the exosome is conserved across species. The 
C-terminal region of Rrp6 does not contact RNA directly. The struc- 
tural analysis suggests that it indirectly contributes to the long 
RNA-binding path by stabilizing Csl4 in an appropriate RNA-binding 
conformation (Fig. 2b and Supplementary Fig. 2b). This also rationa- 
lizes previous reports that Rrp6 binding increases the exoribonuclease 
activity of Exo-10 independently of Rrpé6 activity’’. 

Rrp44 contains an endoribonuclease (the PIN domain) and an exor- 
ibonuclease region (with the CSD1, CSD2, RNB and S1 domains typ- 
ical of RNase II-like enzymes”) (Fig. 3a). Rrp44 binds the RNase PH 


a 
b 
Csl4 
Rrp6 
RNase PH-like 
subunits 


Figure 1 | The crystal structure of a yeast exosome-RNA complex. a, b, The 
structure of S. cerevisiae Exol10-Rrp6c term-RNA is shown as ribbon (a) and 
surface (b) representations in two orientations related by a ~90° rotation 
around a vertical axis. RNA is in black, together with the simulated annealing 
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barrel on the opposite side from the S1/KH ring, but is offset from the 
central axis of Exo-9 by about 50° (Fig. 1, left). It adopts a closed 
conformation (in which the PIN domain contacts CSD2) that is dif- 
ferent from the open conformation observed in the yeast Rrp41- 
Rrp45-Rrp44 structure (in which the PIN domain contacts CSD1 
and S1)"* (Fig. 3b, left). The PIN domain is anchored on the RNase 
PH proteins at a similar position as observed in the Rrp41-Rrp45- 
Rrp44 complex, with the endonuclease site exposed to solvent 
(Supplementary Fig. 3). The relative displacement of Rrp44 can 
roughly be described as large rigid-body rotations of the exoribonu- 
clease region with respect to the Exo-9-PIN module (Fig. 3b). A helix 
of CSD1 seems to be uncoupled from the rest of the exoribonuclease 
region and obstructs the exit of the central channel at the bottom of the 
RNase PH barrel (Fig. 3b). Finally, the N-terminal region of Rrp44, 
which was disordered in previous structures”, forms a long B-hairpin 
that is wedged in between Rrp41-Rrp42 and approaches the N 
terminus of the cap protein Rrp4 (Supplementary Fig. 3b). 


RNA duplex unwinding at the entry pore 

The RNA we used for crystallization binds Exo-10 with the 5’-duplex 
sandwiched between the cap proteins Rrp4 and Rrp40 (Fig. 2b). In 
comparison with the structure of human apo Exo-9 (ref. 6), Rrp4 and 
Rrp40 are only ~4 A closer to the centre of the barrel and thus seem to 
be almost pre-positioned to bind the RNA duplex (Supplementary 
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omit map (contoured at 2.50). The exosome-binding region of Rrp6 wraps 
around Csl4 and contacts the RNase PH ring (right). The Rrp4 N-terminal tail 
reaches across the RNase PH-like barrel towards the N-terminal region of 
Rrp44 (left). RNA is enclosed in the complex (b). 
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a Side view 


Variable Conserved 


Figure 2 | Interaction between Exo-9 and Rrp6. a, Rrp6c.term binds to 
conserved surface patches of Exo-9. The surfaces of Csl4 (yellow), Mtr3 (blue) 
and Rrp43 (green) are coloured according to sequence conservation. The 
zoomed-in views show direct interactions between Exo-9 and Rrp6. b, The 
structure of yeast Exol0-Rrp6c term-RNA is shown in a top-view orientation. 


Fig. 2b). The cap protein Csl4 presents the largest (~10 A) conforma- 
tional difference compared to apo human Exo-9 (ref. 6). Csl4 does not 
contact the RNA duplex directly, but contacts the single-stranded 
RNA (ssRNA) at the entrance of the RNase PH channel (Fig. 2b). 
Starting from the 5’-end of the RNA, the first direct interaction 
with the exosome occurs at the last base pair of the duplex (G32-C45) 


@ RNB 


C8) 


Exoribonuclease region 


Exo-10-Rrp6, —RNA structure 


‘C-term 


Exoribonuclease region 


Apo Exo-10 model 


Figure 3 | Conformational rearrangements of Rrp44. a, Domain 

organization of Rrp44. The D171 endonuclease and D551 exoribonuclease sites 
are circled in yellow and red, respectively. b, On the left, schematics based on 
the crystal structure of Exo-10-Rrp6c term-RNA (top) and a model of Exo-10 
(refs 14, 20) (bottom). The cap proteins are in yellow or orange, the RNase PH 
core is in light grey and Rrp6 is in dark grey. The domains of Rrp44 are coloured 
as in a. Two helices of Rrp45 and Rrp44 are shown as a reference. On the right 
are the crystal structures of the corresponding Rrp44 exoribonuclease regions. 
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b Top view 


The RNase PH ring and Rrpé6 are shown as a surface in grey and red, 
respectively. The $1, KH, N-terminal (NTD) and C-terminal (CTD) domains 
of the cap proteins are indicated. The atomic model and additional density for 
RNA are in black. 


(Fig. 4a). On the 3’-strand, the phosphate backbone of G32 binds 
Rrp4 Arg 123 and the base stacks against Rrp40 Arg 110 (Fig. 4b). 
The complementary nucleotide on the 5’-strand is almost completely 
disordered: in the base-pair conformation, C45 would stereochemi- 
cally clash against a conserved loop of Rrp40 (Supplementary Fig. 4a). 
Thus, the first base pair of the duplex is melted just before entering the 
Exo-9 channel by the structural features of the cap proteins. 

RNA is then threaded in a single-stranded conformation through 
the narrow entrance pore (10-12A across) by sequential base- 
stacking interactions and a base-flipping mechanism (Fig. 4a). The 
base of U31, the first nucleotide of the single-stranded overhang, flips 
out to be sandwiched by m—7 stacking interactions between the side 
chains of Rrp4 Phe 77 and Rrp41 Arg 119 (Fig. 4b). The base of the 
following nucleotide, U30, interacts with Rrp40 Phe 80 and a loop of 
Rrp41 (at Lys62 and Ser 63) (Fig. 4b, c). Consistently, negatively 
charged substitutions at these residues in Rrp41 abolish the long 
RNA-binding path in in vitro assays’* and a negatively charged inser- 
tion at this position is lethal in vivo’. 


Binding of ssRNA in the internal channel 


After U30, the sparse density for ssRNA (left uninterpreted in the final 
atomic model) is in close proximity to positively charged residues of 
the Csl4 S1 domain and to loop regions of the Rrp41-like proteins 
Rrp46 and Mtr3 (Fig. 4c). This binding site corresponds to the posi- 
tion of a bound nucleotide in an archaeal exosome** (Supplementary 
Fig. 4b, left). Below this point, the internal cavity widens and the 
density for RNA becomes correspondingly more diffuse, suggesting 
the presence of multiple conformations and/or weak binding (Fig. 4d). 
We estimate that about eight nucleotides can span this part of the 
channel, which narrows again roughly halfway through the RNase 
PH barrel by the juxtaposition of positively charged residues of 
Rrp42 and of the N terminus of Mtr3 (Fig. 4d). The RNA channel 
then veers sideways. Here, the backbone of U15-U13 is bound by 
positively charged residues of Rrp41 (Arg95) and Rrp45 (Arg71, 
Arg 106, Arg 113 and Arg 114) (Fig. 4a, e). Consistently, charge-swap 
mutations or a negatively charged insertion at these residues abolish 
the long RNA-binding path in in vitro assays'* or cause lethality 
in vivo’. In the archaeal exosome, this sideway channel ends with 
the phosphorolytic active site of Rrp41 (ref. 24) (Supplementary Fig. 
4b, right). In the yeast exosome, the channel continues in Rrp44. 
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Figure 4 | The RNA path through the exosome. a, Schematics showing the 
protein-RNA interactions in the Exo-10-Rrp6c.term-RNA structure. Polar 
interactions are highlighted with dotted lines, and stacking interactions with 
thick black lines. b-g, Zoomed-in panels relative to the overall structure shown 
in the centre. The subunits are indicated in coloured circles. Density not 


The N-terminal region of Rrp41, a loop of Rrp43, the N-terminal region 
of Rrp44 (Arg 26 and Arg 28) and the PIN domain of Rrp44 (Tyr 126 and 
Pro 127) contact nucleotides U13-U10 (Fig. 4a, f). The finding that the 
PIN domain also contributes residues to the long RNA-binding path 
(albeit from a surface opposite to that of the endonuclease site) rationalizes 
recent in vivo cross-linking data suggesting that the exonuclease and PIN 
domains of Rrp44 act together to degrade RNA substrates”*. The RNA 
channel ends in the exoribonuclease. Similar to the structure of Rrp44 in 
isolation’, the U6-U9 nucleotides bind at the interface between the CSD1 
and catalytic domains of Rrp44 (Fig. 4a, f), and the five nucleotides at the 
3'-end (U1-US) bind in the catalytic region (Fig. 4a, g). 


The RNA-binding path through the exosome 


The long RNA-binding path in the internal chamber of the exo- 
some (between the entrance pore of Exo-9 and the exoribonuclease 
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interpreted in the final model is shown (as black mesh) as a difference omit map 
contoured at 2.50. The RNA-binding sites at the entrance (c) and exit (e) of 
Exo-9 are similar to those observed with the archaeal exosome™* 
(Supplementary Fig. 4). 


site of Rrp44) extends over 160 A and is spanned by a minimum of 
25 nucleotides in a single-stranded conformation. In this path, 
the single-stranded overhang of the RNA is almost entirely embedded 
in the interior of the complex and shielded from solvent (Fig. 1b). 
A notable exception is at nucleotides U6-U7, which reside in a 
groove of Rrp44 that is accessible to solvent (Supplementary Fig. 5). 
It is possible that this surface groove allows RNA to access the 
exoribonuclease site directly, rationalizing the short 11-12-nucleo- 
tide-binding path observed in RNase protection assays’* (Sup- 
plementary Fig. Ic). 

Notably, all Exo-9 subunits in addition to Rrp44 contribute directly 
to bind the phosphate groups along the RNA backbone. The electro- 
static interactions are not sequence specific, in line with the ability of 
the exosome to degrade a wide variety of substrates. The interactions 
with RNA are clustered at the substrate entry pore and degradation 
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Archaeal exosome 


Figure 5 | RNA channelling to degradation is a conserved mechanism. 

a, Archaeal exosome structure”™* with RNA (in black) with a nucleotide bound at 
a phosphorolytic site and at the entrance of the exosome (with the 
corresponding Rrp41 subunit in blue, Rrp42 in green, Rrp4 in orange, and the 
other subunits in grey for clarity). b, Yeast exosome structure shown ina similar 


site. The central portion of the RNA in the channel lacks defined 
binding sites. Here, however, the barrel-like shape of the molecule 
confines the substrate. The presence of strong and weak binding sites 
as well as the topological restraints rationalize the processive nature 
of this molecular machine. Topological constraints also promote 
strand separation of a duplex RNA at the 5’-end of the binding path. 
Complete unwinding is achieved by translocation: RNA binding to 
Rrp44 at the U6-U7 pockets seems to be sufficient to drive a duplex 
through the unwinding pore even before 3’-end hydrolysis (Sup- 
plementary Fig. 6). 


Conclusions 


The structure of the yeast exosome bound to a substrate trapped in a 
pre-cleavage state within the long RNA-binding path shows how the 
complex encages and prepares the ribonucleotide chain for degrada- 
tion. RNA enters the internal chamber present in the exosome in 
an unwound conformation, reminiscent of how polypeptides are 
unfolded when entering the proteasome”’. The nucleic acid traverses 
roughly half of the barrel-like structure of Exo-9 and is then chan- 
nelled to the side, to the same binding path that in prokaryotic exo- 
some-like complexes leads to a phosphorolytic site (Fig. 5a). In yeast, 
in the absence of such a catalytic site, the path continues uninter- 
rupted within the Rrp44 nuclease, which curls up on the side of 
Exo-9 and grabs the RNA 3’-end (Fig. 5b). Although the chemistry 
of the reactions is different, the channelling mechanism of exosome 
complexes is remarkably conserved from prokaryotes to eukaryotes 
and parallels the mechanism used by the proteasome to confine and 
prepare polypeptides for processive degradation. 


METHODS SUMMARY 


S. cerevisiae exosome subunits were expressed and purified to homogeneity with 
similar methods to those previously described*'*”* and detailed in the Methods. 
For crystallization, yeast exosome complexes were reconstituted by combining 
Exo-9, Rrp44 and Hise-"SUMO-Rrp6513_693- After cleavage of the Hiss-SUMO 
tag and further purification by size-exclusion and anion-exchange chromato- 
graphy, Exo-10-Rrp651s 693 was incubated in a 1:2 (mol:mol) stoichiometry with 
a pre-annealed 5'-CCCCCGAGAGGGGGU3)-3' RNA _ oligoribonucleotide 
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orientation to the archaeal exosome in a. Rrp41 is in blue, Rrp45 in green (an 
Rrp42-like subunit), Rrp4 in orange, the other RNase PH subunits are in grey, 
and RNA is in black. The eukaryotic and prokaryotic exosomes share the same 
RNA-binding path within Exo-9. 


(Sigma Aldrich). Needle-like crystals of 20 x 50 X 2,500 1m? were grown as 
described in the Methods, and diffracted to 2.8 A resolution (data processing 
statistics are summarized in Supplementary Table 1). The structure was solved 
by molecular replacement and extensive model building and refinement. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Expression and purification. Bacterial expression and purification of Exo-9 
subunits were primarily based on the procedures established previously®. 
Exosome proteins were overexpressed in BL21 (DE3) gold pLysS (Stratagene) 
or BL21 (DE3) T7 express I? (New England Biolabs) cells. Cells were grown at 
37 °C in TB media and isopropyl-B-p-thiogalactoside (IPTG) induced overnight 
at 18°C. 

Protease inhibitors—phenylmethylsulphony] fluoride (PMSF), 4-(2-aminoethyl) 
benzenesulphonyl] fluoride hydrochloride (AEBSF), or cCOmplete protease inhi- 
bitor cocktail (Roche Applied Science)—were added to cell lysate, which was 
loaded on to affinity column (GSH (GE Healthcare) or TALON cobalt (Takara 
Bio)). This was followed by an overnight tobacco etch virus (TEV) or small ubi- 
quitin-like modifier (SUMO) protease cleavage (except for Rrp6 and Csl4 con- 
structs) and loaded again on TALON cobalt a second time to remove uncleaved 
species. In-bead TEV cleavage was done on glutathione S-transferase (GST)- 
Rrp40 protein. After affinity purification, all samples were submitted to ion- 
exchange chromatography (HiTrapQ HP or HiTrapSP HP from GE Healthcare), 
and, with the exception of Csl4 and of Rrp40, proteins were further purified by 
size-exclusion chromatography (Superdex 200 or 75 from GE Healthcare). 
For Rrp44 purification, heparin and MonoQ 4.6/100 PE columns (both GE 
Healthcare) were added to the purification protocol. In particular, MonoQ chro- 
matography was important to remove nucleic acid-bound Rrp44 species. All 
purifications were performed at pH7.5. 

Exosome reconstitution. Equimolar amounts of each Exo-9 component were 
combined, incubated at room temperature for 30 min, and concentrated for size- 
exclusion chromatography (Superdex 200, GE Healthcare) in buffer containing 
20 mM Tris, pH 7.5, 100 mM NaCl, 10% glycerol and 2 mM dithiothreitol (DTT). 
The main peak was pooled and injected onto a MonoQ anion exchange column. A 
shallow linear gradient separated Exo-9 sub-complexes from the sample”. After 
quantifying the purified Exo-9 amounts, we added a 1.3-fold excess molar ratio of 
Rrp44 (pre-run on a MonoQ column to remove nucleic acid-bound species), a 
fourfold excess of SUMO-tagged Rrp6 and a 2.5-fold excess of Csl4. The mixture 
was then incubated in the presence of 1:50 (w/w) (protease:Rrp6) SUMO protease 
at room-temperature for over 1 h. We then added zinc acetate to a final concen- 
tration of 0.2 mM. A final round of Superdex 200 and MonoQ chromatography 
removed further protein excess, impurities and subcomplexes. The peak was 
pooled, buffer-exchanged to a final exosome solution containing 20mM Tris, 
pH7.5, 50mM NaCl, 1mM tris(2-carboxyethyl)phosphine hydrochloride 
(TCEP) and concentrated to 20-30mg ml | for storage at —80 °C. The entire 
reconstitution process was done under RNase-free conditions. 

RNase protection. Exo-10 (500 nM) was added to [*”P]-body-labelled ssRNA™ 
(250 nM) and to increasing concentrations of SUMO-tagged Rrp6 (750, 1,000 
and 2,000 nM). Reaction buffer contained 50 mM HEPES, pH 7.5, 50 mM NaCl, 
5 mM EDTA, pH 8.0, 0.1% NP-40, 10% glycerol and 1 mM DTT toa final volume 
of 201. After incubation at 4°C for more than 1h, unprotected RNA was 
digested with 0.5 jl RNase A/T1 Mix (Thermo Scientific, containing 2 mg ml 
of RNase A and 5,000 U ml! of RNase T1) for 20 min at 20 °C. The reaction was 
quenched with 10 mM EDTA and 1% SDS. RNA products that were protected by 
the exosome were extracted by two rounds of phenol:chloroform:isoamyl alcohol 
(25:24:1 volume ratios from Invitrogen) and followed by ethanol precipitation. 
These reaction products were resolved by 20% (w/v) PAGE containing 7 M urea, 
and detected using Typhoon FLA 7000 (GE Healthcare). Band intensity profiles 
were evaluated using Image]”’. 

Crystallization. Exo-10-Rrp6c-term (18 mg ml‘) was incubated with twofold 
molar excess of pre-annealed RNA (5’-CCCCCGAGAGGGGGU;,-3’) (Sigma- 
Aldrich) in buffer containing 20 mM Tris, pH7.5, 50mM NaCl, 1mM MnCh, 
2mM MgCl, and 1mM TCEP. Crystallization conditions were optimized from 
PACT screen (Qiagen) (0.1M Bis-Tris propane, pH 6.5, 0.2M NaBr and 20% 
PEG3350) to 0.15 M MES, pH 6.5, 0.27 M NaBr and 11.4-12.2% PEG3350 using 
sitting-drop Cryschem plates (Hampton Research). Long needle-like crystals 


(of up to 20 X 50 X 2,500 jum’) presenting best diffraction power grew within 
2 weeks at 19 °C. Crystals are of monoclinic space group P2, with cell dimensions 
of a = 154.09 A, b = 107.44 A, c = 150.46 A and B = 110.63°. 

Data collection and processing. Exo-10-RNA-Rrp6,51s-693) crystals were flash- 
frozen in liquid nitrogen after exchanging mother liquor with 0.1 M MES, pH 6.5, 
0.27 M NaBr, 25% (w/v) PEG3350 and 10% (v/v) glycerol. Data collection was 
carried out at beamline X10SA at the Swiss Light Source, equipped with Pilatus 
6M detector (Dectris). The detector was set at a distance of 600 mm with adjusted 
beam stop position to allow detection of lower resolution spots and the wave- 
length was set to the Br edge of 13.478 keV (0.9199 A). As crystals frozen at 100 K 
decayed rapidly under X-ray radiation, only 8-15° sweeps of contiguous data 
were typically possible to collect before resolution loss. More than 160 sub-data 
sets were collected, from which the best images were merged to a final 99.5% 
complete data set in the 3.5-3.2A resolution shell and 56.5% in the 3.0-2.8 A 
highest resolution shell (Supplementary Table 1). In the final processed data, 
Bijvoet pairs were treated as identical despite the weak anomalous signal from 
bromide ions ((|F(+) — F(—)|/o = 1.192 at 8 A) observed when treated as 
unique. Statistical quality measures of the two data sets were basically unchanged 
with overall Rmerge Values of 17.4% and 16.3% for the data treated as native and as 
anomalous, respectively. The structure was ultimately refined against data con- 
taining merged Bijvoet pairs, as there was a gain in completeness by near 10% at 
the highest resolution shell (3.0-2.8 A). Data were processed using XDS”. 
Structure determination and refinement. Initial phases were calculated by 
molecular replacement using subunits of human Exo-9 (ref. 6) (Supplementary 
Table 2), Sulfolobus solfataricus Rrp41/42 (ref. 24), S. cerevisiae Rrp41/45/44,95 
(ref. 14), S. cerevisiae Rrp44,qo5 (ref. 13) and the C terminus of S. cerevisiae Rrp40 
(ref. 31) crystal structures as search models. Not all protein components were 
found by likelihood enhanced rotation, and translation functions were applied by 
Phaser” from the CCP4 package”’. Several domains, including the CSD2 and $1 
domains of Rrp44, all domains of Csl4 and Rrp4 subunits, and the N-terminal 
domain of Rrp40, Rrp6 and the RNA were placed manually as phases were slowly 
improved by several cycles of model building (Coot*), rigid-body, group 
B-factor, and TLS refinements (PHENIX**). The final model includes all atom 
positional, TLS and individual B-factor refinements. Current refinement statistics 
stand at an Rgee Value of 22.4% and an Rwork Value of 18.3%, using data to 2.80 A. 
Ramachandran plots indicate 0.12% (4 residues) in the disallowed, 4.06% (135 
residues) in the allowed, and 95.82% (3,189 residues) in the favoured regions. 
Detailed statistics and other quality control measures can be found in Supplemen- 
tary Table 1. Figures were prepared with PyYMOL”™ and electrostatic potentials 
were calculated with APBS”’. Surface conservation and Rrp6-interacting residues 
were calculated using Consurf* and AquaProt” servers, respectively. 
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An eclipsing-binary distance to the Large Magellanic 
Cloud accurate to two per cent 


G. Pietrzynski’”, D. Graczyk!, W. Gieren!, I. B. Thompson’, B. Pilecki?, A. Udalski’, I. Soszynski’, S. Kozlowski’, P. Konorski’, 
K. Suchomska?, G. Bono” , P. G. Prada Moroni®”, S. Villanova', N. Nardetto®, F. Bresolin’, R. P. Kudritzki’, J. Storm’®, 
A. Gallenne!, R. Smolec!’, D. Minniti!*8, M. Kubiak”, M. K. Szymanski’, R. Poleski?“, L. Wyrzykowski?, K. Ulaczyk?, 


P. Pietrukowicz’, M. Gorski? & P. Karczmarek? 


In the era of precision cosmology, it is essential to determine the 
Hubble constant to an accuracy of three per cent or better’. At 
present, its uncertainty is dominated by the uncertainty in the 
distance to the Large Magellanic Cloud (LMC), which, being our 
second-closest galaxy, serves as the best anchor point for the cosmic 
distance scale**. Observations of eclipsing binaries offer a unique 
opportunity to measure stellar parameters and distances precisely 
and accurately**. The eclipsing-binary method was previously 
applied to the LMC®%’, but the accuracy of the distance results 
was lessened by the need to model the bright, early-type systems 
used in those studies. Here we report determinations of the dis- 
tances to eight long-period, late-type eclipsing systems in the LMC, 
composed of cool, giant stars. For these systems, we can accurately 
measure both the linear and the angular sizes of their components 
and avoid the most important problems related to the hot, early- 
type systems. The LMC distance that we derive from these systems 
(49.97 + 0.19 (statistical) + 1.11 (systematic) kiloparsecs) is accur- 
ate to 2.2 per cent and provides a firm base for a 3-per-cent deter- 
mination of the Hubble constant, with prospects for improvement 
to 2 per cent in the future. 

The modelling of early-type eclipsing binary systems consisting of 
hot stars is made difficult by the problem of obtaining accurate flux 
calibrations for early-type stars and by the degeneracy between the 
stellar effective temperatures and reddening*’. As a result, the dis- 
tances determined from such systems are of limited (~5-10%) accu- 
racy. More-accurate distances can be obtained using binary systems 
composed of cool stars; however, among the frequent dwarf stars in the 
LMC such systems are too faint to be accurately analysed with present- 
day telescopes. 

The Optical Gravitational Lensing Experiment (OGLE) has been 
monitoring around 35 million stars in the field of the LMC for more 
than 16 years’®. Using this unique data set, we have detected a dozen 
extremely scarce, very long-period (60-772-d) eclipsing binary systems 
composed of intermediate-mass, late-type giants located in a quiet 
evolutionary phase on the helium-burning loop'’ (Supplementary 
Table 1). These well-detached systems provide an opportunity to use 
the full potential of eclipsing binaries as precise and accurate distance 
indicators, and to calibrate the zero point of the cosmic distance scale 
with an accuracy of about 2% (refs 5, 12, 13). 

To do so, we observed eight of these systems (Fig. 1 shows one 
example) over the past 8 yr, collecting high-resolution spectra with the 
MIKE spectrograph at the 6.5-m Magellan Clay telescope at the Las 
Campanas Observatory and with the HARPS spectrograph attached 
to the 3.6-m telescope of the European Southern Observatory on La 


Silla, together with near-infrared photometry obtained with the 3.5-m 
New Technology Telescope located on La Silla. 

The spectroscopic and OGLE V- and I-band photometric observa- 
tions of the binary systems were then analysed using the 2007 version 
of the standard Wilson—-Devinney code’*”*, in the same way as in our 
recent work on a similar system in the Small Magellanic Cloud’. 
Realistic errors in the derived parameters of our systems were obtained 
from extensive Monte Carlo simulations (Fig. 2). The astrophysical 
parameters of all the observed eclipsing binaries were determined with 
an accuracy of a few per cent (Supplementary Tables 2-9). 

For late-type stars, we can use the very accurately calibrated (2%) 
relation between their surface brightness and V—K colour to deter- 
mine their angular sizes from optical (V) and near-infrared (K) pho- 
tometry’®. From this surface-brightness/colour relation (SBCR), we 
can derive angular sizes of the components of our binary systems 
directly from the definition of the surface brightness. Therefore, the 
distance can be measured by combining the angular diameters of the 
binary components derived in this way with their corresponding linear 
dimensions obtained from the analysis of the spectroscopic and pho- 
tometric data. The distances measured with this very simple but accur- 
ate one-step method are presented in Supplementary Table 12. The 
statistical errors in the distance determinations were calculated by 
adding quadratically the uncertainties in absolute dimensions, V—K 
colours, reddening and the adopted reddening law. The reddening 
uncertainty contributes very little (0.4%) to the total error'’”. A sig- 
nificant change in the reddening law (from R, = 3.1 to 2.7, where R, is 
the ratio of total to selective absorption) causes an almost negligible 
contribution, at the level of 0.3%. The accuracy of the V—K colour for 
all components of our eight binary systems is better than 0.014 mag 
(0.7%). The resulting statistical errors in the distances are very close to 
1.5%, and are dominated by the uncertainty in the absolute dimen- 
sions. By calculating a weighted mean from the individual distances to 
the eight target eclipsing binary systems, we obtain a mean LMC 
distance of 49.88 + 0.13 kpc. 

Our distance measurement might be affected by the geometry and 
depth of the LMC. Fortunately, the geometry of the LMC is simple and 
well studied’*. Because nearly all the eclipsing systems are located very 
close to the centre of the LMC and to the line of nodes (Fig. 3), we 
fitted the distance to the centre of the LMC disk plane, assuming its 
spatial orientation’®. We obtained an LMC barycentre distance of 
49.97 + 0.19 kpe (Fig. 4), which is nearly identical to the simple 
weighted mean value. This shows that the geometrical structure 
of the LMC has no significant influence on our present distance 
determination. 
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Figure 1 | Change of the brightness of the binary system OGLE-LMC-ECL- 
06575 and the orbital motion of its components. a, The main panel shows the 
orbital motion of the two binary components in the system OGLE-LMC-ECL- 
06575. Filled circles, primary component; open circles, secondary component. 
The top panel shows the residuals of the fit (see below): observed radial velocities 
(O) minus the computed radial velocities (C). b, The main panel shows the 
I-band light curve (1,200 epochs collected over 16 yr) of OGLE-LMC-ECL- 
06575 together with the solution obtained using the Wilson—Devinney code. P, 
orbital period. The top panel shows the residuals of the observed magnitudes 
from the computed orbital light curve. All individual radial velocities were 
determined by the cross-correlation method using appropriate template spectra 
and spectra from the Magellan Inamori Kyocera Echelle (MIKE) and High 
Accuracy Radial Velocity Planet Searcher (HARPS) spectrographs, yielding in 
all cases velocity accuracies better than 200 m s | (error bars smaller than the 
plotted symbols). The orbit (mass ratio, systemic velocity, velocity amplitudes, 
eccentricity and periastron passage) was fitted with a least-squares method to the 
measured velocities. The resulting parameters are presented in Supplementary 
Tables 2-9. The spectroscopic orbits, light curves and solutions for the 
remaining systems are of similar quality. 


The systematic uncertainty in our distance measurement comes 
from the calibration of the SBCR and the accuracy of the zero points 
in our photometry. The root mean squared scatter in the current SBCR 
is 0.03 mag (ref. 13), which translates to an accuracy of 2% in the 
respective angular diameters of the component stars. Because the sur- 
face brightness depends only very weakly on metallicity'®*”’, this effect 
contributes to the total error budget at the level of only 0.3% (ref. 9). 
Both optical (V) and near-infrared (K) photometric zero points are 
accurate to 0.01 mag (0.5%). Combining these contributions quadra- 
tically, we determine a total systematic error of 2.1% in our present 
LMC distance determination. 

The LMC contains significant numbers of different stellar distance 
indicators, and, being the second-closest galaxy to our own, offers us a 
unique opportunity to study these indicators with the utmost pre- 
cision. For this reason, this galaxy has an impressive record of several 
hundred distance measurements”*’. Unfortunately, almost all LMC 
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Figure 2 | Error estimation of the distance for one of our target binary 
systems. The reduced y* map (goodness of fit) for the system OGLE-LMC- 
ECL-15260 showing the dependence of the goodness of fit to the V-band and 
I-band light curves on the distance modulus of the primary component. This 
map was obtained from 110,000 models computed with the Wilson—Devinney 
code"*"* within a broad range of the primary and secondary radii, R; and R;, the 
orbital inclination, i, the phase shift, ¢, the secondary’s temperature, T,, and 
albedo, A>. In each case the distance, d, was calculated from the V-band surface- 
brightness/colour (V—K) relation’® and translated into distance modulus using 
m—M =5log(d)—5, where m and M are the observed and absolute 
brightnesses, respectively. The horizontal lines correspond to the standard- 
deviation limits of the derived distance modulus, of 18.509 mag (50.33 kpc): 1a, 
20 and 3o, from bottom to top. 


distance determinations are dominated by systematic errors, with each 
method having its own sources of uncertainties. This prevents a cal- 
culation of the true LMC distance by simply taking the mean of the 
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Figure 3 | Location of the observed eclipsing systems in the LMC. Most of 
our eight systems (circles) are located quite close to the geometrical centre of the 
LMC and to the line of nodes (line), resulting in very small corrections to the 
individual distances for the geometrical extension of this galaxy (in all cases 
smaller than the corresponding statistical error in the distance determination). 
The effect of the geometrical structure of the LMC on the mean LMC distance 
reported here is therefore negligible. The background image has a field of view of 
8° X 8° and is taken from the All Sky Automated Survey wide-field sky survey”*. 
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Figure 4 | Consistency among the distance determinations for the target 
binary systems. Distance offsets, Ad, between our particular eclipsing binary 
systems and the best-fitted LMC disk plane, plotted against the angular distance 
of the systems from the LMC centre, p. The identification of the systems is the 
same as in Fig. 3. The error bars correspond to lo errors. We assumed the 
model of the LMC from ref. 18. We fitted one parameter, the distance to the 
centre of the LMC (RA = 5h 25 min 06s; dec. = —69° 47’ 00’’), using a fixed 
spatial orientation of the LMC disk (inclination, i = 28°; position angle of the 
nodes, ( = 128°). The resulting distance to the LMC barycentre is 

49.97 + 0.19 kpc, with a reduced 7 value very close to unity. 


reported distances resulting from different techniques. Our present 
LMC distance measurement, of 49.97 + 0.19 (statistical) + 1.11 (sys- 
tematic) kpc (that is, a true distance modulus of 18.493 + 0.008 (statis- 
tical) + 0.047 (systematic) mag), agrees well, within the combined 
errors, with the most recent determinations of the distance to the 
LMC”°*, Our purely empirical method allows us to estimate both 
statistical and systematic errors in a very reliable way, which is normally 
not the case, particularly in distance determinations relying in part on 
theoretical predictions of stellar properties and their dependences on 
environment. In particular, our result provides a significant improve- 
ment over previous LMC distance determinations made using observa- 
tions of eclipsing binaries’’*. Those studies were based on early-type 
systems for which no empirical SBCR is available, and so had to rely on 
theoretical models to determine the effective temperature. Our present 
determination is based on many (eight) binary systems and does not 
resort to model predictions. 

The classical approach to deriving the Hubble constant (Ho) consists 
of deriving an absolute calibration of the Cepheid period—luminosity 
relation (CPLR), which is then used to determine the distances to 
nearby galaxies containing type la supernovae’. These supernovae 
are excellent standard candles that extend out to the region of unper- 
turbed Hubble flow once their peak brightnesses are calibrated this 
way, and provide the most accurate determination of Hy (ref. 24). An 
alternative approach to calibrate the CPLR with Cepheids in the LMC 
is to calibrate it in our own Milky Way galaxy using Hubble Space 
Telescope (HST) parallax measurements of the Cepheids nearest to the 
Sun”. However, the CPLR that results from that approach is less 
accurate for two reasons. First, the Cepheid sample with HST paral- 
laxes is very small (ten stars) relative to the Cepheid sample in the LMC 
(2,000 stars), which can be used to establish the CPLR once the LMC 
distance is known. Second, the average accuracy of the HST Cepheid 
parallaxes is 8% (ref. 25) and suffers from systematic uncertainties that 
are not completely understood, including Lutz-Kelker bias**’’. 
Therefore, at present the preferred route to determining the Hubble 
constant is that which uses the highly abundant LMC Cepheid 
population whose mean distance is now known, with the result of this 
work, to 2.2%. This result reduces the uncertainty in Ho to a very firmly 
established 3%. 
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We have good reason to believe that there is significant room to 
improve on the high-accuracy LMC distance determination reported 
here, by improving the calibration of the SBCR for late-type stars'*’°, 
which is the dominant source of systematic error in our present deter- 
mination. This work is in progress, and a determination of the distance 
to the LMC accurate to 1% should be possible once the SBCR cal- 
ibration is refined. This will have a corresponding effect on further 
improving the accuracy of Hp. This is similar to the accuracy of the 
geometrical distance to the LMC, which is to be delivered by Gaia 
mission in around 12 years from now. The eclipsing-binary technique 
will then probably provide the best opportunity to check on the future 
Gaia measurements for possible systematic errors. 
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Porous materials with optimal adsorption 
thermodynamics and kinetics for CO, separation 
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Shengqian Mal, Brian Space’, Lukasz Wojtas', Mohamed Eddaoudi!? & Michael J. Zaworotko! 


The energy costs associated with the separation and purification of 
industrial commodities, such as gases, fine chemicals and fresh 
water, currently represent around 15 per cent of global energy pro- 
duction, and the demand for such commodities is projected to 
triple by 2050 (ref. 1). The challenge of developing effective sepa- 
ration and purification technologies that have much smaller 
energy footprints is greater for carbon dioxide (CO,) than for other 
gases; in addition to its involvement in climate change, CO, is an 
impurity in natural gas, biogas (natural gas produced from bio- 
mass), syngas (CO/H2, the main source of hydrogen in refineries) 
and many other gas streams. In the context of porous crystalline 
materials that can exploit both equilibrium and kinetic selectivity, 
size selectivity and targeted molecular recognition are attractive 
characteristics for CO, separation and capture, as exemplified 
by zeolites 5A and 13X (ref. 2), as well as metal-organic materials 
(MOMs)*»°. Here we report that a crystal engineering’ or reticular 
chemistry~” strategy that controls pore functionality and size in a 
series of MOMs with coordinately saturated metal centres and 
periodically arrayed hexafluorosilicate (SiF,”~) anions enables a 
‘sweet spot’ of kinetics and thermodynamics that offers high volu- 
metric uptake at low CO, partial pressure (less than 0.15 bar). Most 
importantly, such MOMs offer an unprecedented CO, sorption 
selectivity over N>, H, and CHy,, even in the presence of moisture. 
These MOMs are therefore relevant to CO, separation in the 
context of post-combustion (flue gas, CO,/N2), pre-combustion 
(shifted synthesis gas stream, CO2/H;) and natural gas upgrading 
(natural gas clean-up, CO,/CH,). 

Porous materials with unsaturated metal centres (UMCs)'° or 
organic amines that chemically interact with CO, enhance selectivity 
for CO, in the presence of other gases. However, there are drawbacks: 
high energy costs associated with activation, regeneration and recyc- 
ling of the sorbent material, especially for amines''; competition with 
water vapour, especially for UMCs”’; and selectivity tends to mono- 
tonically decrease with increased loading of sorbate. Consequently, 
there remains a need for sorbents with favourable CO) sorption kine- 
tics and thermodynamics over a wide range of CO) loading that would 
permit efficient CO, capture with low regeneration costs. MOMs are 
attractive in this context because they are inherently modular—that is, 
they consist of metals or metal clusters (‘nodes’ or ‘molecular building 
blocks’) coordinated to multi-functional organic ligands (‘linkers’)— 
and they offer extra-large surface areas, up to 7,000m7g * (ref. 6). 
However, although extra-large surface area facilitates high gravimetric 
uptake of gases at low temperature and/or high pressure, it is not 
necessarily conducive to efficient separations under practical condi- 
tions. Here we address how to optimize the thermodynamics and 
kinetics of gas adsorption through a class of MOMs that is amenable 
to crystal engineering or isoreticular chemistry in a manner that faci- 
litates precise control over pore size and functionality: namely, ‘pil- 
lared square grids’, two-dimensional nets based on linked metal nodes 


that are pillared via SiF,” anions (‘SIFSLX’) in the third dimension to 
form three-dimensional nets with primitive cubic topology”. 

[Cu(4,4’ -bipyridine),(SiF.)],,, a prototypal primitive-cubic net that 
remains one of the best sorbents for CH, as measured by volumetric 
uptake", exhibits highly selective CO, uptake versus both CH, and N, 
at 1 bar and 298 K (ref. 15). In the absence of UMCs or amine groups, 
we attributed this behaviour to favourable interactions between CO, 
and SIFSIX. This compound, which we call here SIFSIX-1-Cu, exhibits 
one-dimensional square channels (pore size 9.54 A; here and through- 
out this Letter, pore sizes are given as diagonal dimensions) aligned by 
a periodic array of SIFSIX pillars, and is prototypical for a class of 
compounds that is amenable to pore-size tuning. In this Letter we 
report the synthesis, structure and sorption properties of three variants 
of SIFSIX-1-Cu with expanded and contracted pore sizes. 

Reaction of 4,4’-dipyridylacetylene, dpa (ref. 16), with CuSiF, 
afforded purple rod-shaped crystals of [Cu(dpa)2(SiF¢)],,. which we 
refer to as SIFSIX-2-Cu (see Supplementary Information for synthetic 
and crystallographic details for this and other compounds reported 
here). SIFSIX-2-Cu forms the expected primitive-cubic net with 
square channels of pore dimensions 13.05 A (Fig. 1a). The interpene- 
trated polymorph, SIFSIX-2-Cu-i (Fig. 1b), is composed of doubly 
interpenetrated nets that are isostructural to the nets in SIFSIX-2- 
Cu. The independent nets are staggered with respect to one another, 
affording 5.15 A pores (Fig. 1b). The isoreticular MOM based on 
pyrazine (pyr) SIFSIX-3-Zn, [Zn(pyr)2(SiF.)],,, was prepared accord- 
ing to published procedures” and is also a primitive-cubic net which 
encloses 3.84 A channels (Fig. 1c). Pore sizes in this series therefore 
range from ultra-microporous to nanoporous. Bulk purity was con- 
firmed using powder X-ray diffraction (PXRD) patterns (Supplemen- 
tary Figs 1-3). 

Activation of SIFSIX-2-Cu and SIFSIX-2-Cu-i (evacuation at 298 K 
for 12 h) afforded BET apparent surface areas of 3,140 and 735 m? gy 
respectively (corresponding Langmuir values, 3,370 and 821 m7g ‘), 
from N2 adsorption isotherms at 77K. Micropore volumes are in 
good agreement with corresponding theoretical values (Supplemen- 
tary Fig. 4 and Supplementary Table 3). SIFSIX-3-Zn adsorbs minimal 
amounts of N> at 77K and thus its surface area (250m g') was 
determined from the CO; isotherm collected at 298 K (ref. 18). 

Low-pressure CO, CH, and N> sorption data were collected at 
298 K (Supplementary Fig. 5a, Table 1). SIFSIX-2-Cu exhibited CO, 
uptake of 41.4cm*g ' (equivalent to 1.84 mmolg ' or 81.3mgg_') 
at 298 K and 1 bar, but its denser polymorph, SIFSIX-2-Cu-i, exhibited 
substantially higher values of 121.2cm* g ' (5.41 mmolg’ ', 238 mg 
g_'). Such behaviour has also been observed in the context of hydro- 
gen adsorption”. A review of the literature reveals that the gravimetric 
CO), uptake of SIFSIX-2-Cu-i at 298 K and 1 bar is among the highest 
yet reported in the context of MOMs (for example, Mg-dobdc (ref. 10), 
Co-dobdec (ref. 10), MIL-101 (ref. 20), [Cu(Me-4py-trz-ia)] (ref.21) and 
partially hydrated HKUST-1 (ref. 22)). Notably, the above-mentioned 
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SIFSIX-2-Cu 


Figure 1 | The variable pore size channel structures of SIFSIX-2-Cu, 
SIFSIX-2-Cu-i and SIFSIX-3-Zn. a, SIFSIX-2-Cu; pore size 13.05 A, BET 
apparent surface area (N> adsorption) 3,140 m* g_ |. b, SIFSIX-2-Cu-i; pore size 
5.15 A, BET apparent surface area (N, adsorption) 735 m? g 1 ¢, SIFSIX-3-Zn; 
pore size 3.84 A, apparent surface area (determined from CO) adsorption 


benchmark MOMs possess higher surface area, are less dense than 
SIFSIX-2-Cu-i and contain UMCs. Volumetric CO, uptake of SIFSIX- 
2-Cu-i at atmospheric pressure approaches that of Mg-dobdc (151 
versus 163 v/v). Ideal adsorbed solution theory (IAST)”* calculations 
indicate binary gas adsorption selectivity (Supplementary Fig. 5b) 
under practically relevant conditions (298 K; CH, and N2 mole frac- 
tions equal to 0.5 and 0.9, respectively) to be dramatically higher for 
SIFSIX-2-Cu-i than SIFSIX-2-Cu for both CO3/CH, (33 versus 5.3) and 
CO,/N; (140 versus 13.7). These findings agree with the CO,/CH, (51) 
and CO,/N> (72) adsorption selectivity determined experimentally for 
SIFSIX-2-Cu-i using column breakthrough tests, a technique that deter- 
mines the necessary time for a given volume of a gas to pass through a 
given sorbent bed column (Supplementary Fig. 6). To the best of our 
knowledge, the CO,/CH, and CO,/N; IAST selectivities exhibited by 
SIFSIX-2-Cu-i are the highest yet reported for a MOM without UMCs 
or amino groups. We attribute these observations to the enhanced 
isosteric heat of adsorption (Q,,) of SIFSIX-2-Cu-i versus SIFSIX-2- 
Cu (45% higher at minimum loading, 71.5% greater at 2.8 mmol g ', 


Table 1 | Gas adsorption/COz selectivity results and comparisons 


SIFSIX-2-Cu-i SIFSIX-3-Zn 


isotherm) 250 m* gl. Colour code: C (grey), N (blue), Si (yellow), F (light 
blue), H (white). All guest molecules are omitted for clarity. Note that 

the green net represents the interpenetrated net in SIFSIX-2-Cu-i. The 
nitrogen-containing linker present in SIFSIX-2-Cu and SIFSIX-2-Cu-i is 
4,4'-dipyridylacetylene (dpa) whereas that in SIFSIX-3-Zn is pyrazine (pyr). 


Supplementary Fig. 5c). This increase is presumably attributable to 
better overlap of attractive potential fields of opposite walls in the 
relatively narrower pores of SIFSIX-2-Cu-i. SIFSIX-2-Cu-i is particu- 
larly suitable for CO separation from syngas, as shown by its selectivity 
(240) for CO, over Hz in a CO2/H2:30/70 mixture, and as determined 
from column breakthrough experiments (Supplementary Fig. 6c). 
The heart of pressure- and temperature-swing adsorption (PSA and 
TSA) processes for CO, removal is the adsorbent bed: a recent study 
projected that a CO2/N> selectivity of >500 combined with a capacity 
of 2-4mmol g! for a CO2/N>:10/90 mixture would be required for 
practical utility (Supplementary Fig. 7)**. Figure 2a and b presents 
the CO, adsorption isotherms of SIFSIX-2-Cu-i and SIFSIX-3-Zn, 
respectively, collected at sub-atmospheric pressures after activation 
at 298 K. Contraction of the pores led to a sharp increase in CO uptake 
at low CO, loading, with nearly 11 wt% at 0.1 bar for SIFSIX-3-Zn 
versus 4.4 wt% at 0.1 bar for SIFSIX-2-Cu-i. Notably, the CO, uptake 
for SIFSIX-3-Zn reached saturation at relatively low pressures 
(~0.3 bar; Supplementary Fig. 8), whereas the isotherm for CO, 


Compounds 
Property SIFSIX-2-Cu SIFSIX-2-Cu-i SIFSIX-3-Zn Mg-dobdc* 13X+ 
Pore size (A) 13.05 5.15 3.84 10.8 10 
Single-gas data 
Qst for COz at low COz loading (kJ mol” +) 22 31.9 45 47-52 44-54 
COz uptake at 298 K at 0.1 bar/1 bar (mgg”?) 10/81.3 76/238 105/112 220/352 106/220 
COs uptake at 298 K at 0.1 bar/1 bar (em? cm~°) 3/26 48/151 84/90 101/162 61/126 
CH, uptake at 298 K at 1 bar (mgg’') 6.2 7.5 12.6 178 4.2 
No uptake at 298 K at 1 bar (mgg ') 49 4.2 6.4 NA 6.4 
Ho uptake at 298 K at 1 bar (mgg +) NM 0.2 1.37 NA NA 
Mixed-gas data 
COz uptake at 298 K in CO2/N2:10/90 mixture 8.4 70/558 99.9t/104.44 NA NA 
at 1 bar (mgg +) 
COz uptake at 298 K in CO2/CH4:50/50 mixture 42.8% 1831/1388 1081/1104 NA NA 
at 1 bar (mgg +) 
COz uptake at 298 K in CO2/H2:30/70 mixture NM 858 1121) NA NA 
at 1 bar (mgg +) 
Selectivity at 1 bar CO2/No2 13.7% 1401/728 1,8184/1,7004 NA 420% 
Selectivity at 1 bar CO2/CH4 5.3t 334/518 2314/3509 137t 103% 
Selectivity at 1 bar CO2/H> NM 2408 >1,800 800* NA 


NA, not available; NM, not measured. 

* Ref. 29 (313 K data). 

+ Ref. 26 (298 K data). 

$IAST. 

§ Breakthrough experiments. 

\| Mixture gravimetric (G) experiment. 

Mixture gravimetric-densimetric gas analysis (GDGA) experiment. 
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Figure 2 | Pure gas adsorption, modelling and gas mixture breakthrough 
studies of SIFSIX compounds. a, b, Variable temperature CO, sorption 
isotherms for SIFSIX-2-Cu-i (a) and SIFSIX-3-Zn (b). c, Q, of CO2 adsorption 
on SIFSIX-2-Cu-i and SIFSIX-3-Zn in the low pressure region. d, The modelled 
structure of a 3 X 3 X 3 box of unit cells of SIFSIX-3-Zn reveals close 


adsorption on SIFSIX-2-Cu-i reached a plateau at relatively higher 
pressures (5-7 bar) (Supplementary Fig. 9b). As a result, SIFSIX-3-Zn 
exhibits high volumetric CO, uptake that is comparable to those of Mg- 
dobdc (ref. 10) and UTSA-16 (ref. 25) at a CO) partial pressure typical 
for post-combustion CO, capture (Supplementary Fig. 23). 

Figure 2c presents the Q, of CO, adsorption for SIFSIX-2-Cu-i 
and SIFSIX-3-Zn from variable temperature isotherms (Fig. 2a, b), 
and the Q, of up to 45kJ mol ' is consistent with the steepness of 
the CO, isotherms. The relatively constant Q,, indicates homogene- 
ous binding sites over the full range of CO, loading (Fig. 2c). These 
Qs values are in the ‘sweet spot’ favourable for efficient, reversible 
adsorption-desorption—that is, strong but still reversible—and are 
supported by modelling studies (Fig. 2d, Supplementary Figs 25-27). 

The CO, selectivity of SIFSIX-3-Zn was investigated via column 
breakthrough tests using binary CO,/N,:10/90 (Fig. 2e) and CO,/ 
CH4:50/50 gas mixtures (Fig. 2f) at 298 K and atmospheric pressure, 
and compared to the corresponding breakthrough tests on SIFSIX-2- 
Cu-i. Remarkably, SIFSIX-3-Zn showed much higher selectivity (495 
and 109 for CO /N,:10/90 and CO,/CH,:50/50, respectively) than 
SIFSIX-2-Cu-i, as CO, was retained for longer times (for example, 
~2000 s versus 300 s for CO2/N>). Notably, N2 and CH, breakthrough 
occurred within seconds, indicative of high selectivity toward CO). To 
support and confirm the high selectivity derived from the break- 
through experiments, single-gas (CO2, N2, CH, and Hz) sorption iso- 
therms were measured at low and high pressures and IAST calculations 
were used to predict adsorption equilibria for the following binary 
mixtures: CO3/CH4:05/95, CO2/CH4:50/50, CO/N3:10/90 and CO3/ 
H,:30/70. These mixtures mimic natural gas upgrading, biogas treat- 
ment, and post- and pre-combustion capture applications, respectively. 

Figure 3a and b shows that the CO, adsorption selectivity of 
SIFSIX-3-Zn calculated for binary gas separation versus CH, and N2 
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interactions between the electropositive carbon atoms of CO, molecules and 
fluorine atoms of SIFSIX anions. Colour code: C (grey), H (white), N (blue), O 
(red), Si (yellow), F (green), Zn (purple). e, Column breakthrough experiment 
for a CO2/N:10/90 gas mixture (298 K, 1 bar) carried out on SIFSIX-2-Cu-i 
and SIFSIX-3-Zn. f, As e but for a CO2/CH4:50/50 gas mixture (298 K, 1 bar). 


is unprecedented, outperforming Mg-dobdc (ref. 10), UTSA-16 
(ref. 25) and zeolite 13X (ref. 26). Indeed, the selectivity of SIFSIX-3- 
Zn is comparable to that of amine-functionalized MOFs” and amine- 
bearing mesoporous silica”®, particularly at low CO) partial pressure. 
The calculated selectivity for CO2/N> (that is, 1,539 + 307 at 1 bar and 
298 K) was validated by gas mixture gravimetric adsorption experi- 
ments at various pressures (Fig. 3c, d). 

With regards to CO,/H, mixtures, adsorption isotherms of CO,/ 
H2:30/70 were collected and showed similar shapes and uptakes to that 
obtained using pure CO (Supplementary Fig. 10). This indicates that 
SIFSIX-3-Zn adsorbs CO, with very large selectivity over H, (higher 
than 1,800), making it potentially suitable for pre-combustion capture 
or H, purification. Because of the large error associated with H, 
adsorption measurement (due to the relatively low uptake), quantitat- 
ive measurements of CO,/H, selectivity were not possible. We note 
that calculated and measured selectivities exceeding 1,000 are often 
subject to uncertainties associated with measurement of the uptake of 
weakly adsorbed gases. Therefore, it would be inappropriate in this 
case to make quantitative comparisons between different adsorbents 
such as SIFSIX-3-Zn and Mg-dobdc” (800 at 1 bar and 313K). 

To confirm the synergistic nature of the thermodynamics and kine- 
tics for CO, capture, competitive adsorption kinetic studies of the 
above gas mixtures were conducted and are presented in Fig. 3e. We 
note that the CO, non-equilibrium uptake at equal times for CO2/N>, 
CO,/CH, and CO,/H), mixtures follows the behaviour of pure CO2. In 
addition, at equilibrium the total CO, uptake from the CO,-containing 
gas mixtures agrees perfectly with the equilibrium uptake for pure 
CO). These distinctive findings show that when CO,-containing mix- 
tures are in contact with SIFSIX-3-Zn, CO, adsorbs more strongly 
and faster than N>, O,, CH, and H,, thus occupying all the available 
space and sorption sites and consequently excluding other gases. Most 
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Figure 3 | Gas mixture selectivity of SIFSIX compounds and the stability 
study of SIFSIX-2-Cu-i. a, Calculated (using IAST) CO, adsorption selectivity 
for two different CO,/CH, mixtures on SIFSIX-2-Cu-i and SIFSIX-3-Zn 
compared to Mg-dobdc and 13X zeolite at 298 K. Experimental data using 
gravimetric-densimetric gas analysis (GDGA) are provided for comparison. 
b, IAST CO,/CH,:50:50 adsorption isotherm prediction compared to 
experimental pure CO2, CH4 and CO3/CH4:50:50 gas mixture adsorption 
isotherms collected using gravimetric (G) adsorption experiments for SIFSIX- 
3-Zn at 298 K. c, CO adsorption selectivity of SIFSIX-2-Cu-i, SIFSIX-3-Zn and 


importantly, SIFSIX-3-Zn fulfils the demanding attributes (Supplemen- 
tary Fig. 7) required for economical and efficient CO post-combustion 
separation. Further, increasing the adsorption temperature did not 
significantly reduce the steepness of the CO, adsorption isotherm 
for SIFSIX-3-Zn (Fig. 2b, Supplementary Fig. 8); this is a desirable 
feature in many CO, separation and purification applications. 

Whereas the sorbents reported here exhibit very good performance 
with respect to CO) selectivity, their amenability to recycling and effi- 
cacy in the presence of moisture must also be addressed. The former 
was validated via adsorption-desorption cycle experiments conducted 
at 323 K and 0.15 bar (Supplementary Fig. 14). The latter—specifically, 
the effect of water vapour on the CO, capacity and selectivity of 
SIFSIX-2-Cu-i and SIFSIX-3-Zn—was evaluated via a series of adsorp- 
tion measurements. The water vapour adsorption isotherms are found 
to be of type I, with uptakes of 20 wt% and 11 wt%, respectively, at 74% 
relative humidity (Supplementary Fig. 24). Water adsorption affinity/ 
capacity is reduced in the presence of CO, gas mixtures, as shown by 
breakthrough experiments, especially for SIFSIX-3-Zn (Supplemen- 
tary Figs 15b and 16b). Importantly, the presence of water in the given 
gas mixture had a negligible effect at elevated CO, concentrations 
(Supplementary Fig. 15) in the case of SIFSIX-2-Cu-i. Regarding the 
CO,/H2:30/70 mixture, CO, uptake and selectivity were only slightly 
reduced in the presence of moisture (1.61 mmol g ' and 191 at 74% 
relative humidity versus 1.99 mmol g | and 237 at 0% relative humi- 
dity for SIFSIX-2-Cu-i, Supplementary Fig. 15b). Whereas SIFSIX-2- 
Cu-i was structurally unchanged by exposure to moisture (Fig. 3f), 
SIFSIX-3-Zn undergoes a reversible phase change at relatively high 
humidity (Supplementary Figs 19-22). 

We have demonstrated how a crystal engineering or reticular 
chemistry approach to pore size control, coupled with favourable 


13X zeolite for CO3/N>:10:90 as calculated using IAST at 298 K. d, [AST CO,/ 
N,:10:90 adsorption isotherm predictions compared to experimental pure CO, 
N, and CO,/N;:10:90 gas mixture adsorption isotherms collected using 
gravimetric (G) adsorption experiments for SIFSIX-3-Zn at 298 K. e, Kinetics of 
adsorption of SIFSIX-3-Zn for pure gases and gas mixtures containing various 
compositions of CO}. f, PXRD patterns of SIFSIX-2-Cu-i after multiple cycles 
of breakthrough tests, high-pressure sorption, and water sorption experiments 
(compared to the calculated pattern). 


electrostatic interactions provided by an array of inorganic anions, 
affords porous materials with exceptional selectivity, recyclability 
and moisture stability in the context of several industrially relevant 
CO, separation applications. The structural features and exceptional 
mixed-gas sorption properties of the SIFSIX compounds reported here 
show that it is now possible to combine equilibrium'®"’”° and kinetic*® 
adsorption selectivity in the same porous material to facilitate effective 
CO, separation and capture. 
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Low-temperature aqueous- phase methanol 
dehydrogenation to hydrogen and carbon dioxide 


Martin Nielsen’, Elisabetta Alberico’*, Wolfgang Baumann’, Hans-Joachim Drexler’, Henrik Junge’, Serafino Gladiali* 


& Matthias Beller! 


Hydrogen produced from renewable resources is a promising 
potential source of clean energy. With the help of low-temperature 
proton-exchange membrane fuel cells, molecular hydrogen can be 
converted efficiently to produce electricity’ °. The implementation 
of sustainable hydrogen production and subsequent hydrogen con- 
version to energy is called “hydrogen economy”. Unfortunately, its 
physical properties make the transport and handling of hydrogen 
gas difficult. To overcome this, methanol can be used as a material 
for the storage of hydrogen, because it is a liquid at room temper- 
ature and contains 12.6 per cent hydrogen. However, the state-of- 
the-art method for the production of hydrogen from methanol 
(methanol reforming) is conducted at high temperatures (over 
200 degrees Celsius) and high pressures (25-50 bar), which limits 
its potential applications®*. Here we describe an efficient low- 
temperature aqueous-phase methanol dehydrogenation process, 
which is facilitated by ruthenium complexes. Hydrogen generation 
by this method proceeds at 65-95 degrees Celsius and ambient 
pressure with excellent catalyst turnover frequencies (4,700 per 
hour) and turnover numbers (exceeding 350,000). This would make 
the delivery of hydrogen on mobile devices—and hence the use of 
methanol as a practical hydrogen carrier—feasible. 

Of the known hydrogen storage materials* *, methanol represents a 
promising energy carrier. Its convenient properties (12.6% hydrogen 
and being liquid at room temperature) suggest that a “methanol 
economy” may be possible'*. To combine the advantages of simple 
methanol transportation and the energy efficiency of proton-exchange 
membrane fuel cells, so-called ‘reformed methanol fuel cells’ have been 
developed. However, methanol-reforming processes are insufficient 
unless high temperatures (>200 °C) are used. Therefore, the state- 
of-the-art efficiency of reformed methanol fuel cells is only approxi- 
mately 40%. Moreover, very low concentrations of carbon monoxide 
are tolerated in the fuel cell (<10 parts per million, p.p.m.), which is 
challenging for current catalytic systems. In general, methanol reform- 
ing occurs in the steam phase using heterogeneous catalysis and high 
temperatures®. Notably, Dumesic et al. have developed a Pt/Al,03- 
based catalyst that allows for aqueous-phase methanol reforming at 
more acceptable temperatures (200-225 °C) and pressure (25-50 bar)’*. 
In addition, the use of less expensive and more abundant Cu/Zn/Al,O03 
catalyst systems is also noteworthy, although they need higher tempera- 
tures than does the Pt/Al,O; catalyst®. 

Our long-term interest in efficient low-temperature hydrogen gen- 
eration’*’’, led us to identify active molecular-defined catalysts for a 
low-temperature (<100 °C) aqueous-phase methanol dehydrogena- 
tion process. We envisioned a one-pot stepwise process including 
initial dehydrogenation of methanol to hydrogen and formaldehyde, 
water-promoted dehydrogenation of formaldehyde to formic acid and 
hydrogen, and final dehydrogenation of formic acid to hydrogen and 
carbon dioxide. This should lead to an overall yield of three mole- 
cules of hydrogen and one of carbon dioxide, thus resembling classic 


methanol reforming (Fig. 1a). Either methanol (MeOH) or HO could 
be in excess, depending on the potential applications. 

To the best of our knowledge, all known molecular-defined organo- 
metallic catalysts for alcohol dehydrogenation promote the liberation 
of only one molecule of hydrogen from each alcohol molecule’. 
Moreover, there exists no example of homogeneously catalysed dehy- 
drogenations of aldehydes in the presence of water. Recently, orga- 
nometallic complexes containing pincer-type ligands (capable of 
enabling cooperative metal-ligand catalysis) showed interesting activ- 
ities in catalytic hydrogenation and dehydrogenation reactions’**????>-, 
Therefore, we envisioned that this type of catalyst might be suitable for 
selective methanol dehydrogenation, and possibly even for methanol 
reforming. To our delight, testing a variety of ruthenium-based pincer 
complexes (Supplementary Scheme 1 and Supplementary Table 2) in a 
4:1 mixture of MeOH/H,O containing 0.5 M NaOH revealed significant 
catalyst activity for complexes 1a and 2a (the turnover frequency is the 
amount of hydrogen produced by each catalyst molecule per hour, 
120h ‘and45h_ ‘, respectively; Fig. 1b). Interestingly, in these reactions 
hydrogen generation had already occurred at 72.0 °C and all reactions 
produced very pure hydrogen gas. 
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Figure 1 | Methanol reforming by homogeneous catalysis. a, Schematic 
pathway for a homogeneously catalysed methanol reforming process via three 
discrete dehydrogenation steps. b, Best-performing catalysts among those 
tested (see Supplementary Information) for aqueous-phase methanol 
dehydrogenation performed with 0.5 M NaOH at 72.0 °C. 
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Table 1 | Ru-catalysed aqueous-phase methanol dehydrogenation 


fa or2 
MeOH +H,0 —2"*8 5 


+ C, residuals 


Entry Catalyst amount MeOH/HZO ratio Reaction Turnover frequency, Turnover frequency, Turnover frequency, 

(mol, p.p.m.) temperature (°C) 1h(h-4) 2h(h+) 3h(h-4) 

1 la 4:1 89.0 754 819 860 
(4.18, 21) 

2 la 9:1 91.0 1,023 1,051 1,093 
(4.18, 19) 

3 2a 9:1 91.0 1,281 1,250 1,151 
(22.2, 100) 

4 2a 9:1 91.0 2,276 2,228 2,205 
(4.18, 19) 

5 2a 9:1 91.0 2,668 2,687 2,670 
(1.58, 1.8) 

6 la 3:2 86.5 90 121 130 
(4.18, 28) 

r 2a 3:2 86.5 732 719 rid 
(4.18, 28) 

8 2a 13S 95.0 265 228 201 
(1.58, 16) 

| 2a neat MeOH 95:0 4,719 4,716 4,734 
(1.58, 1.6) 


Investigation into the effect of MeOH/H20 ratio on catalyst activity. Performed with catalyst 1a or 2a (amounts are given in pmol and loading in pmol mol! of MeOH is stated as parts per million) ina 10-40 ml 
MeOH/H20 solution containing 8.0 M KOH (molarity based on total MeOH/H20 volume). For entries 1-7, the reaction mixtures were under reflux. The H2/CO>z ratio, determined by gas-phase chromatography, was 
consistently >500:1. The gas evolution was determined by gas burette measurements. Each molecule of hydrogen is counted as resembling one turnover. 


Under catalytic conditions (Supplementary Scheme 5) both formate 
and carbonate ions were observed in the nuclear magnetic resonance 
(NMR) as traces of the reaction mixtures, showing that the formate is 
an intermediate in this dehydrogenation sequence and that CO, is 
trapped as carbonate (Supplementary Figs 16 and 19). As shown in 
Fig. la, the first dehydrogenation step of methanol results in formalde- 
hyde, which, however, we never detected in solution. The subsequent 
conversion to formate and hydrogen occurs rapidly under these con- 
ditions, but formate conversion is apparently more sluggish. Moreover, 
due to the alkaline character of the solution, any evolved CO} is initially 
trapped as carbonate. Hence, in the first phase of the reaction a base- 
mediated reforming sequence proceeds according to equation (1): 


CH;OH + 2NaOH = 3H, + NasCO3 (1) 
a 400,000 5 
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Figure 2 | Approaching ‘real’ aqueous methanol reforming. a, Aqueous 
methanol dehydrogenation using 1.0 p.p.m. of catalyst 2a in a 9:1 ratio of 
MeOH/H,O containing 8.0 M KOH. b, Set-up for continuous aqueous 
methanol reforming using 2a. Evolved gas volume (red line) and gas 
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Although no bicarbonate (NaHCOs;) was detected in the reaction 
mixtures, it should not be conclusively excluded as a possible product 
given the change in pH observed over time (as discussed below). 
Owing to the inherent complexity of the overall methanol reforming 
via dehydrogenation reactions, we assign turnover frequency and 
turnover number (the amount of hydrogen per catalyst molecule) as 
measures of the catalyst activity and productivity, respectively, for the 
production of each H, molecule. In fact, the successive production of 
three H, molecules and one CO, molecule gives a turnover number 
of 3. 

Optimization of the reaction conditions demonstrates that several 
factors affect the catalyst activity: significant influences are seen by the 
nature of the base, its concentration, the water content, and the tem- 
perature. Exploring a range of bases and different base concentrations 
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composition (blue bars) as a function of time, with conditions as follows: 
MeOH/H,0 4:1 (10 ml), NaOH 0.1 M, 2a 49.3 mol, 250 p.p.m. with respect to 
MeOH, 72 °C). For details, see Supplementary Table 4. 
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in a 4:1 MeOH/H;0 solution led to the finding that 8.0 M potassium 
hydroxide (KOH) gave optimal results (Table 1, entry 1). It should be 
noted that the temperature increase due to such a high base concen- 
tration (salt effect) seems to be far more important than the high base 
concentration itself in boosting activity (Supplementary Scheme 2 and 
Supplementary Table 3). Using 8.0M KOH, complex 2a was more 
active than complex 1a with a turnover frequency of up to 2,687 h~! 
(Table 1, entries 2-5). Not surprisingly, increasing the water content 
(MeOH/H,0 = 3:2) had a detrimental effect on the catalytic activity. 
The poor solubility of 1a explains the increase in activity over time 
because it is only slowly transformed into the catalytic active species 
(Table 1, entry 6). Again using 2a instead of 1a resulted in a more than 
fivefold improvement of activity (Table 1, entry 7). Even a highly 
diluted methanol system (MeOH/H,O = 1:9) still afforded significant 
hydrogen evolution, albeit with lower activity (Table 1, entry 8). From 
a practical point of view, this latter result is important because it allows 
the use of an aqueous methanol solution that may vary in methanol 
concentration over time. When performing an experiment simply with 
methanol and potassium hydroxide, a remarkable activity of approxi- 
mately 4,720 h ~ 1 was obtained (Table 1, entry 9). Moreover, when the 
experiment in entry 4 of Table 1 was repeated in the presence of a large 
excess (>500 equivalents) of Hg, no decrease in activity was observed, 
which supports the homogeneous nature of the catalytic system. 

To prove the stability of the catalyst system, a long-term experiment 
using 0.88 mol of 2a in 40 ml of a 9:1 MeOH/H,0O solution, corres- 
ponding to a catalyst loading of 1 pmol mol’ of MeOH (1 p.p.m.), 
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was performed (Fig. 2a; Supplementary Scheme 3 and Supplementary 
Figs 13-15). In this case, MeOH is in excess and is thus taking the role 
of both the reactant and the solvent. The molecular catalyst was stable 
for more than three weeks and after 23 days, a total turnover number of 
350,000 was achieved. Even after this time, the catalyst was still active 
with an instant turnover frequency of 200h ', measured as an average 
over the last 24h. In this case, the hydrogen yield calculated for 
full methanol reforming was 27% (12% with respect to MeOH, 
Supplementary Equations (3) and (4)). The reaction showed almost 
constant activity for the last ten days. Importantly, less than 1 p.p.m. of 
both CH, and CO were observed by gas-phase gas chromatography 
throughout the reaction. By increasing the catalyst loading, the per- 
formance was easily improved. Hence, the same hydrogen yield is 
reached within 90 min when using 150,mol mol! (150 p.p.m.) of 
2a. Within 24h, the yield of H2 increases to 59% under these condi- 
tions and that of CO, approaches 43% (26% and 19%, respectively, if 
referring solely to MeOH). We did not perform any further optimiza- 
tions towards higher conversion. 

To show that a ‘real’ aqueous methanol-reforming process with a 3:1 
H,/CO) gaseous content can be achieved, an experiment in the pres- 
ence of low amounts of base (0.1 M of NaOH ina 4:1 MeOH/H,0) was 
performed (Supplementary Scheme 4). Under these conditions, the 
majority of CO, is released instead of being trapped as carbonate in 
solution. Indeed, in the presence of 250 p.p.m. of 1a, the expected 3:1 
H,/CO) gas composition was observed after approximately 5-6 h and 
then stayed constant until the reaction was stopped (Fig. 2b). We note 
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Figure 3 | Proposed catalytic cycle for Ru-promoted aqueous-phase 
methanol dehydrogenation. Proposed catalyst activation and catalytic cycle 
for low-temperature methanol reforming catalysed by 2¢, made in situ from 


Outer-sphere, 
fast 


2a. The phosphorus iPr substituents have been omitted for clarity. { denotes 
transition state; solvent-H indicates a molecule of solvent involved in hydrogen 
bonding. 
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that a correlation between solution pH and gas composition is 
observed (Supplementary Table 4). The pH, initially at 13, dropped 
to approximately 10 over the first 4h and then stayed constant for the 
rest of the reaction time. It seems that in the first phase of the reaction 
an equilibrium among hydroxide/(bi)carbonate/formate is approached, 
which then leads to a ‘steady state’ gas evolution, as would be expected 
if methanol reforming is taking place. The higher-than-expected con- 
tent in H, observed even after onset of the equilibrium is due to the fact 
that the hydrogen generated in the pre-equilibrium phase has not been 
discharged. Upon addition of aqueous HC] to a reaction mixture hav- 
ing pH 11 (Supplementary Table 4, entry 4), about 0.7 mmol of CO, 
were evolved, proving that NaOH is actually consumed in the initial 
course of the reaction. Similar to the long-term reaction experiment, a 
stable gas evolution (activity) was reached; this time after 2-3 h. 

Careful analysis of Fig. 2b shows that the overall process goes 
through three distinct reaction stages: In the first stage, corresponding 
to the first hour, the process is mediated by the base and a substantial 
amount of free hydroxide is still present in the reaction solution. Here, 
over 90% of the gas is H2 and the pH stays >12. The second stage is the 
equilibrating phase when the gas distribution, hydroxide consumption 
and pH are approaching a steady state. During this period, the activity 
changes at a slower rate while approaching the steady-state value as 
well. In the third stage (after 4-5 h and thereafter), steady-state con- 
ditions are established and the hydroxide/(bi)carbonate/formate con- 
centration, gas evolution and distribution, and pH remain constant. 
Thus, the gas flux contains 3:1 H,/CO,, and pH stays at about 10. 
Considering the observed general stability of the catalytic system 
(Fig. 2a), gas evolution can, under these conditions, last for weeks. 

On the basis of in situ NMR experiments using 2a as the pre-catalyst 
(Supplementary Figs 16-19), we propose the following catalytic cycle 
for our process (Fig. 3). Reaction of 2a with base affords the more active 
species 2c, which is able to dehydrogenate MeOH to yield hydrogen 
and formaldehyde. In agreement with previous observations"®, we sug- 
gest that the generation of the first hydrogen molecule occurs through 
an outer-sphere concerted process as in I, that is, without direct coor- 
dination of methanol to the metal. Because formaldehyde itself as well 
as its hydrated species are barely detected in solution, HCHO might 
undergo a sequence of fast reactions while still coordinated to the metal 
centre. Attack of hydroxide gives the gem-diol(ate) II, which is prob- 
ably further stabilized by a solvent-cage of the protic H,O/MeOH. 
Again, outer-sphere dehydrogenation releases the second hydrogen 
molecule and formate is formed. At this point, the catalyst either re- 
starts the catalytic cycle by losing formate and reacting with MeOH or 
it performs the final step of the catalytic cycle via III, leading to CO, 
and the third molecule of H3. Because free formate is actually detected 
in solution, its dehydrogenation through reaction with 2c must be slow 
at low formate concentration in competition to MeOH. However, the 
fact that substantial CO, formation is observed at a point where MeOH 
is still in large excess suggests that formate dehydrogenation might 
be relatively faster than MeOH dehydrogenation. 

Notably, hydride species are observed in solution under catalytic 
conditions (Supplementary Fig. 17), most probably 2a and 2e-g. It 
seems that the need for base to keep the catalytic system active is related 
to the continuous formation of 2¢ from such species and not to the 
deprotonation of methanol and formic acid. We may speculate whether 
the alkalinity of the medium might enhance reactivity by deprotona- 
tion of methanol. However, this would most probably require an inner- 
sphere mechanism, which has been questioned previously’. 

In conclusion, we have demonstrated aqueous-phase methanol 
dehydrogenation and reforming at low temperature. Compared to 
previously known heterogeneous systems, we observe low amounts 
of contaminant gases CO and CH, (<10p.p.m.). Key to success is 
the use of molecular-defined [RuHCl(CO)(HN(C;H,4PiPr2)2)| (where 
iPr = CH(CHs3),) catalysts, which exhibit high activities up to 4,700 h~ 
and catalyst productivities of >350,000. The optimized system is stable 
under aqueous alkaline conditions and remains active for more than 
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three weeks. Once the relative concentrations of the reagents and 
possible intermediates competing for the catalyst have reached suitable 
steady-state values, the ratio of H, to CO, expected for MeOH aqueous 
reforming (H2/CO, 3:1) is observed. Moreover, this system represents 
full conversion of all ‘available’ hydrogen atoms in the substrates to 
hydrogen by homogeneous catalysis. An intrinsic drawback is the base 
needed for this reaction to stay active. Nevertheless, using this system 
it might be possible to combine the advantages of methanol as the 
‘hydrogen carrier’ and the superior efficiency of proton-exchange 
membrane fuel cells compared to methanol fuel cells. 


METHODS SUMMARY 


All reactions were performed under inert atmosphere (argon) with exclusion of 
air. A solution of MeOH and H,0O ina given ratio, containing a defined amount of 
base, was heated to a certain temperature and let equilibrate for 30 min. Then an 
amount of one of five catalysts (1-4, and 5 (ref. 30)) was added to the solution. The 
amount of catalyst is reported in parts per million relative to MeOH. Gas evolution 
was measured by a gas burette (100-ml, 500-ml or 1-litre burette). Each molecule 
of hydrogen is counted as resembling one turnover. After the desired reaction 
time, the identity of the gas components and their ratio was determined by gas- 
phase chromatography. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


All reactions were performed under inert atmosphere (argon) with exclusion of 
air. Methanol was refluxed over magnesium and distilled under argon atmosphere. 
Water was distilled and degassed before use. NaOH and KOH were purchased and 
used as received. [RuHCI(CO)(PPh;)3], [RuHCI(CO)(HN(C,H,PPh,)>)] com- 
plex la (Ru-MACHO), [RuH(BH,4)(CO)(HN(C,H4PPh2)2)] complex 1b (Ru- 
MACHO-BH), HN(C2H4PiPro)2, HN(C2H4PiCy2)2 and carbonylchlorohydrido 
[6-(di-t-butylphosphinomethyl)-2-(N,N-diethylaminomethyl)pyridine] _ ruthe- 
nium(II) (complex 4; see Supplementary Scheme 1) were purchased from 
STREM Chemicals Inc., stored under argon, and used as received. [RuHCI(CO) 
(HN(C2H4PiPr2)2)}’’ (complex 2a), [Ru(H)2(CO)(HN(C2H4PiPr2)2)]”’ (complex 
2b; Supplementary Figs 6 and 7), [RuH(CO)(N(C,H,PiPr2)2)|”’ (compound 2c; 
Supplementary Figs 8 and 9), [RuHCl(CO)(HN(C,H4PCy.),)] (compound 3; 
Supplementary Figs 10-12), and complex 5 (ref. 30; see Supplementary Scheme 
1) were prepared following or adapting procedures in the literature. Catalytic 
reactions for the measurement of hydrogen gas evolution were carried out in a 
thermostated double-walled three-necked flask equipped with a high-efficiency 
reflux condenser. The latter was connected via a three-way stopcock to a gas 
burette (see Supplementary Figs 4 and 5). At the beginning of each experiment, 
the reactor system (flask and condenser) was subjected to five consecutive 
vacuum-argon cycles, while isolated from the burette. Then the whole system, 


including burette, was purged with argon for at least 15 min. A defined amount of 
water was then introduced in the flask, followed by the required amount of 
base and methanol. The system was then heated to the desired temperature by 
means of a thermostat. Once the temperature set in the thermostat had been 
reached, the system was allowed to equilibrate for 30 min, after which time a 
homogeneous solution had formed, even in those cases when a very high base 
concentration was used. The inner temperature reported for each experiment 
was measured with a ground-glass joint thermometer connected to the reactor 
(Supplementary Table 1). The desired amount of catalyst was weighed in a Teflon 
crucible and introduced into the reactor. The reaction solution was stirred at 
700 r.p.m. To ascertain the composition of the evolved gas, a sample of the gas 
phase was withdrawn through a septum in the three-way stopcock with a gas-tight 
syringe and analysed by gas-phase chromatography using either HP Plot Q/flame 
ionization detector-hydrocarbons, carboxen/thermoconductivity detector-per- 
manent gases, He carrier gas or carboxen/thermoconductivity detector/methani- 
zer/flame ionization detector-permanent gases, He carrier gas (for representative 
gas-phase gas chromatography traces, see Supplementary Figs 1-3). The second 
gas-phase gas chromatography (see Supplementary Experimental Methods GC b) 
allows the detection of very low levels of CH, and CO. The turnover number was 
calculated from the amount of hydrogen produced. Every hydrogen molecule 
represents one catalyst turnover number (see Supplementary Equations (1) 
and (2)). 
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Dynamics of a Snowball Earth ocean 


Yosef Ashkenazy', Hezi Gildor”, Martin Losch’, Francis A. Macdonald‘, Daniel P. Schrag*? & Eli Tziperman*” 


Geological evidence suggests that marine ice extended to the 
Equator at least twice during the Neoproterozoic era (about 750 
to 635 million years ago)’”, inspiring the Snowball Earth hypothesis 
that the Earth was globally ice-covered**. In a possible Snowball 
Earth climate, ocean circulation and mixing processes would have 
set the melting and freezing rates that determine ice thickness”*, 
would have influenced the survival of photosynthetic life**’”~°, and 
may provide important constraints for the interpretation of geo- 
chemical and sedimentological observations*'®. Here we show that 
in a Snowball Earth, the ocean would have been well mixed and 
characterized by a dynamic circulation", with vigorous equatorial 
meridional overturning circulation, zonal equatorial jets, a well 
developed eddy field, strong coastal upwelling and convective mix- 
ing. This is in contrast to the sluggish ocean often expected in a 
Snowball Earth scenario’® owing to the insulation of the ocean from 
atmospheric forcing by the thick ice cover. As a result of vigorous 
convective mixing, the ocean temperature, salinity and density were 
either uniform in the vertical direction or weakly stratified in a few 
locations. Our results are based on a model that couples ice flow and 
ocean circulation, and is driven by a weak geothermal heat flux 
under a global ice cover about a kilometre thick. Compared with 
the modern ocean, the Snowball Earth ocean had far larger vertical 
mixing rates, and comparable horizontal mixing by ocean eddies. 
The strong circulation and coastal upwelling resulted in melting 
rates near continents as much as ten times larger than previously 
estimated®’. Although we cannot resolve the debate over the exis- 
tence of global ice cover’®’*’, we discuss the implications for the 
nutrient supply of photosynthetic activity and for banded iron for- 
mations. Our insights and constraints on ocean dynamics may help 
resolve the Snowball Earth controversy when combined with future 
geochemical and geological observations. 

The flow of thick ice over a Snowball Earth ocean (“sea glaciers”, 
characterized by dynamics very different from that of thinner sea 
ice“*), has received significant attention over the past few years” ”?"*). 
Similarly, the role and dynamics of atmospheric circulation and heat 
transport, CO, concentration, cloud feedbacks, and continental con- 
figuration have been studied’®'’, as has the role of dust over the 
Snowball Earth’s ice cover'”'’. In contrast, despite its importance, 
the ocean circulation during Snowball Earth events has received little 
attention. The few studies that used full-ocean General Circulation 
Models concentrated mostly on the ocean’s role in Snowball Earth 
initiation and aftermath***’. No studies accounted for the combined 
effects of thick ice cover and flow, and driving by geothermal heat- 
ing’**??°, yet ref. 11 simulated an ocean under a 200-m-thick ice 
cover with no geothermal heat flux, and described a non-steady-state 
solution with near-uniform temperature and salinity, and vanishing 
Eulerian circulation together with strong parameterized eddy-induced 
high-latitude circulation cells. 

To allow us to simulate the special circumstances during Snowball 
Earth events, we use a novel model that couples the ocean and thick 
ice flow (see Methods). We begin by exploring the Snowball Earth 
circulation using an easier-to-understand two-dimensional (2D, 


latitude and depth) ocean model with no continents. The results point 
to the importance of ocean eddy motions, and given that these have 
never been studied for a Snowball Earth ocean, we investigate them 
using a high-resolution sector ocean model. Finally, we consider a 
near-global three-dimensional (3D) ocean model with reconstructed 
Neoproterozoic continental geometry to verify that our insights remain 
valid in this more realistic configuration. 

We consider first the zonally averaged temperature, salinity and 
circulation, including the meridional overturning circulation (MOC). 
Figure 1 shows steady-state results from a 2D (latitude and depth) 
version of the ocean model coupled to a one-dimensional (1D, latitude) 
ice flow model, driven by geothermal heating (about 0.1 W m_ ”) that is 
enhanced over a prescribed bottom ridge (Supplementary Fig. 2). We 
assume the presence of land ice at a volume equivalent to about one 
kilometre of sea level (Methods), and our model predicts that the upper 
kilometre of the ocean is frozen as well. Concentrating salt in a signifi- 
cantly lower ocean volume resulted in ocean salinity that is significantly 
higher than that of the present day (Fig. 1b). The high salinity and high 
pressure at the bottom of the ice lowered the freezing temperature to 
approximately —3.5° C, explaining the cold temperature field (Fig. 1a). 
Variations in temperature and salinity are fairly weak"', as expected 
given the insulating thick ice cover and weak geothermal heat forcing. 

The geothermal heat flux acts to create low-density water at the 
ocean bottom, and therefore leads to convective vertical mixing that 
makes the temperature and salinity effectively vertically uniform 
nearly everywhere, creating a water-mass pattern that is completely 
different from that expected during any other period in Earth history 
(Fig. 1a, b). Near the enhanced geothermal heating over the ridge, there 
is a weakly stable stratification due to meltwater production over the 
enhanced heating (bounded by the thick black contour in Fig. 1a, b). 

The MOC (Fig. 1d) is surprisingly strong and is confined to around 
the Equator, very different from the high-latitude present-day MOC. 
The meridional velocity, which together with the vertical velocity 
composes the MOC, therefore decays very rapidly away from the 
Equator. The most prominent features in the zonal velocity field are 
the two strong jet-like flows near the Equator, flowing at opposite 
directions on the two sides of the Equator, and decaying slowly away 
from the Equator (Fig. 1c). The zonal velocity in the higher latitudes, 
away from the equatorial jet-like structures, is generally eastward 
throughout most of the water column apart from near the bottom. 
This eastward mid- to high-latitude velocity is in geostrophic equilib- 
rium between the Coriolis force and the pressure gradient, and is 
driven by the large-scale meridional pressure and hence by the density 
gradient (which is a consequence of heating and melting over the ridge, 
lowering the density there; Methods). 

To understand the equatorial zonal flows and MOC, we note that 
their dominant momentum balance is geostrophic in the north-south 
direction, and hydrostatic in the vertical, as is the case in the present- 
day ocean. The east-west momentum balance, however, is between the 
Coriolis force and eddy (turbulent) viscosity, which is negligible in 
most of the present-day ocean. The dominant momentum balances 
are therefore: 
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Figure 1 | Results of a 2D (latitude and depth) ocean model coupled to a 1D 
(latitude only) ice flow model. a, Potential temperature (colour scale; white 
area on top corresponds to the calculated ice cover thickness). b, Salinity 
(colour scale, in parts per thousands, p.p.t. ), which varies due to freezing and 
melting (f) induced by spatially variable geothermal heating and heat flux 
through the ice owing to the latitude-dependent atmospheric temperature 
(Methods); salinity is somewhat lower in the Northern Hemisphere because of 
the ridge with enhanced geothermal heating and therefore enhanced melting 


—fv= VpUyy 
fu = —Py!Po 
Pz = —8P 


where the east, north and vertical coordinates and velocities are 
denoted (x,y,z) and (u,v, w), g is gravitational acceleration, v, = 2 
x 10*m?s | the horizontal eddy viscosity coefficient, f~ By is the 
Coriolis parameter, using the equatorial / plane approximation, p is 
the density, po a reference density, and p is the pressure. The Sue 
scale obtained from the zonal momentum equation is L ~ (v,/p)"”? 
=~ 100km, consistent with the extent of the MOC in the numerical 
solution. Assuming the meridional pressure gradient set by the differ- 
ential geothermal heating to be approximately constant in latitude near 
the Equator, we can solve the above equations (Methods) to find: 


u = [gpy(z + H/2)/(Bpo)|Q/y) 
v= —[2vngp,(z + H/2)/(B" po) |(1/y*) 


where H is the ocean depth. The more rapid decay of the meridional 
velocity vas function of latitude y in these expressions explains why the 
MOC is restricted to the equatorial region, while the equatorial zonal 
flows u extend farther poleward. These solutions also predict that u 
changes sign across the Equator (where y = 0) but v does not, and also 
that both horizontal velocities change sign with depth, at z= —H/2, all 
remarkably consistent with Fig. 1c, d. 

The ice thickness in these solutions (Fig. le) is quite uniform, vary- 
ing from 1,120 m to 1,200 m, owing to the homogenizing effect of the 


there. c, Zonal velocity (colour scale). d, MOC stream function (colour scale; 
1Sv=10°m?s— ae showing a strong equatorial MOC (35 Sv), compared to the 
present-day high-latitude North Atlantic MOC (about 20 Sv). e, Ice thickness 
and ice velocity as functions of latitude. f, Freezing rate (negative values imply 
melting) at the ice base together with prescribed sublimation/precipitation rates 
at the ice surface, showing that ocean contribution to setting the ice thickness is 
comparable to or larger than that of the atmosphere. 


ice flow’, consistent with previous studies”’’. High-latitude basal freez- 
ing and low-latitude melting rates are balanced by an equatorward ice 
flow and are comparable to, if not larger than, the ice surface sublima- 
tion and precipitation rates (Fig. 1f). These rates are quite different 
from previous estimates that ignored ocean dynamics’, highlighting 
the important role of ocean dynamics. 

The above 2D model predicts the ocean turbulent eddy field to have 
a dominant role in the momentum budget, via its parameterization by 
the horizontal eddy viscosity v,. Snowball Earth eddy motions have 
not been studied before to our knowledge, and to examine them, we 
considered the results of a high-resolution model, shown in Fig. 2. This 
model demonstrated that the solution for the ocean circulation is 
surprisingly turbulent and time-dependent, far from the stagnant 
Snowball Earth ocean one might envision given the lack of wind for- 
cing and air-sea fluxes. The zonal velocity field (Fig. 2c, f) shows 
several strong jets off the Equator, in addition to the two equatorial 
jets seen in the 2D solution. These jets, reminiscent of those seen in the 
atmosphere of Jupiter, arise from the action of the eddies. That is, the 
equatorial flows are unstable and generate eddies; the meridional con- 
vergence of eddy momentum fluxes du’v’ /dy generates additional jets 
off the Equator (Fig. 2f), which again are unstable and feed back on the 
eddies that sustain the jets”*. 

The high-resolution run also shows a warm area to the northwest of 
the land mass, which leads to local melting rates of up to 10cmyr | 
(Fig. 2e), about ten times larger than the maximum melting rates 
calculated by coarse model runs*’. This warmer area is due to a con- 
centrated coastal upwelling of deep water heated by geothermal 
heating. The upwelling occurs in response to the equatorial zonal flow 
away from the continent. This deep heating is allowed by a very weak 
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Figure 2 | Results of a 3D high-resolution sector ocean model showing a rich 
time-dependent turbulent eddy field. a, Snapshot of the temperature field 
(colour scale) at 1,150 m (that is, 125 m below the ice). b, As for a, but for 
salinity (colour scale). c, As for a, but for zonal velocity (colour scale). d, Time- 
averaged MOC (colour scale). e, Time-averaged melting rate (colour scale; zero 


local salinity-induced stratification due to geothermally driven ice 
melting (similar to that seen over the ridge in Fig. 1b). Although it is 
intriguing, the prescribed ice thickness in this model does not allow us 
to use this large melting rate conclusively to deduce the existence of 
thin ice or open water locally near continents, as suggested by some 
observations”®, and to explain the survival of photosynthetic life during 
a global Snowball Earth**’>”°. The explicitly resolved eddy field can be 
used to estimate a mixing timescale from low to high latitudes of 
500 years (Methods), which is not very different from present-day 
basin-scale mixing rates. 

Finally, a 3D ocean solution with a realistic continental configu- 
ration supports the results of the 2D model (Fig. 3a, b and Sup- 
plementary Fig. 3). The zonally averaged fields (Supplementary 
Fig. 3), equatorial zonal jets, large-scale westward flows in the higher 
latitudes, the existence of an equatorial MOC, and the vertically uni- 
form temperature, salinity and density in most areas away from the 
enhanced geothermal heating, are all consistent with the 2D solution 
discussed above. Differences from the 2D case include a weaker salinity 
range due to the concentrated geothermal heating region being con- 
fined and therefore weaker in the 3D case (spatial average is0.1 Wm 7 
in both); the equatorial MOC is composed of two cells owing to the 
more complex continental and geothermal heating configuration. Like 
the high-resolution model discussed above, the global 3D solution 
in the presence of continents shows upwelling and downwelling 
wherever the equatorial zonal jets encounter continents. This upwel- 
ling is possibly due to the weak stratification of the Snowball Earth 
ocean, whereas in an ocean with present-day like stratification these 
jets would be diverted sideways via horizontal boundary currents. 

The weakly stratified, convectively mixed ocean found here, as well 
as the strong coastal upwelling, strong equatorial MOC and well 
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values marked by thick black contour). f, Zonal, depth and time means of zonal 
velocity (black), and of scaled convergence of eddy momentum fluxes 
—6u'v'/dy (red), both as function of latitude. The close correspondence 
between the two demonstrates that eddy momentum flux convergence has a 
dominant role in the generation of the zonal jets. 


developed eddy field, all imply that the deep and surface ocean are 
strongly linked in a Snowball Earth scenario, unlike in the well strati- 
fied present-day ocean. The well-mixed Snowball Earth ocean’’ has 
several interesting geochemical implications. First, if there are areas of 
photosynthesis under a local patch of thin ice, mixing or upwelling are 
needed to resupply limiting nutrients, because a weakly mixed ocean 
would lead to the exhaustion of the local nutrient pool and produc- 
tivity would end. Second, the return of banded iron formations after a 
billion-year absence is an iconic feature of Neoproterozoic glaciation 
that is often attributed to stagnant Snowball Earth deep water’. 
Assuming that melt water was derived from land ice that contained 
air bubbles of an oxygenated atmosphere, one might expect that with 
enhanced melting and a strongly mixed ocean, iron concentrations 
could not rise sufficiently to deposit banded iron formations. However, 
Neoproterozoic banded iron formations, unlike Archean and Palaeo- 
proterozoic ones, are not broadly distributed’®. Instead, they form 
irregular lenticular bodies within glacial deposits, and are found pre- 
dominantly in juvenile rift basins in close association with volcanic 
rocks, analogous to the Red Sea, where iron is supplied by hydrothermal 
fluids’’. Such rift basins are somewhat isolated from the rest of the well- 
mixed ocean, allowing the build-up of iron levels and the development 
of banded iron formations, not inconsistent with our results showing a 
well-mixed ocean. 

Our findings suggest that beneath the Snowball Earth ice cover, the 
Neoproterozoic oceans were dynamic, well mixed"! and spatially and 
temporally variable: far from a stagnant pool. These ocean dynamics 
insights open the way to a reinterpretation of some geochemical, sedi- 
mentary and paleontological observations, possibly contributing to the 
debate on the existence of a global ice cover and to our understanding 
of the Neoproterozoic era. 
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Figure 3 | Results of the 3D ocean model coupled to a 2D (latitude and 
longitude) ice flow model, in the presence of reconstructed Neoproterozoic 
continental configuration. a, Temperature at 1,200 m depth (colour scale), 
areas of enhanced geothermal heating (black contour lines) and land masses 
(white areas). b, Salinity at 1,200 m (colour scale). ¢, Ice thickness (colour scale), 
and ice velocity vectors (plotted every fourth grid point in the longitude 


METHODS SUMMARY 


Our model couples the state-of-the-art ocean general circulation model, MITgcem”*, 
which can simulate thick ice shelves”, with a recently developed 2D (latitude and 
longitude) ice flow model"*, extending previous 1D flow models®’. The model is run 
here in three configurations: 2D (latitude and depth) ocean model coupled to a 1D 
(latitude) ice flow model; a high-resolution (1/8°) sector model with prescribed 
uniform ice cover; and a 3D global model with reconstructed Neoproterozoic 
continental configuration coupled to a 2D ice flow model. A geothermal heat flux 
is prescribed in all three versions at the bottom of the ocean (Supplementary Fig. 2), 
with a spatial average of 0.1 Wm”, and is spatially uniform except not far from the 
Equator (motivated by the Neoproterozoic continental reconstruction of ref. 30), 
where the geothermal flux is up to four times its background value. The 2D ocean 
model also prescribes a ridge over which the heating is enhanced, while the topo- 
graphy of the two other models is flat. Sensitivity experiments to the location and 
amplitude of the enhanced heating, and to the topography in both the 2D and 3D 
models, show our solution to be very robust. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Model description. The model used here couples the Massachusetts Institute of 
Technology general circulation ocean model (MITgcm)”* with its ice-shelf pack- 
age” to a recently developed 2D (latitude and longitude) model of thick ice flow 
over a Snowball Earth ocean’*, which is an extension of the 1D ice flow model of 
refs 6 and 7 and similar ones*'~’. The ice flow model compensates for melting and 
sublimation at low latitudes and freezing/precipitation at high latitudes, with an 
equatorward ice flow calculated from ice-thickness gradients based on Glenn’s 
law. The ice flow model employs the well known ice-shelf approximation in which 
the velocity is independent of depth**”’, together with an assumed linear vertical 
temperature profile within the ice®, which together make it possible to average over 
the vertical dimension rather than model it explicitly. 

The coupling of the ice and ocean models is done asynchronously: the ocean 
model is run for 300 years and the ice model is then run for a similar period, and 
the process is repeated until a steady state is obtained. At each iteration the ocean 
model is given the ice thickness calculated by the ice model, while the ice model is 
driven by the melting and freezing rates calculated by the ocean model. The ice- 
surface meridional temperature, sublimation and snowfall are taken from ref. 7, 
with an Equator-to-pole temperature difference of 36° C (Supplementary Fig. 1). 
No-slip ocean boundary conditions are specified on side and bottom boundaries, 
and free-slip conditions are specified under the ice cover. 

Turbulent convection is parameterized in our model experiments by an 
increased vertical diffusion where the vertical stratification is unstable. Eddy mix- 
ing and mixed-layer parameterizations developed for the present-day ocean*® are 
probably not applicable for the very weakly stratified Snowball Earth ocean, and 
we use instead crude horizontal and vertical eddy diffusion. A different choice was 
made by ref. 11, who used an eddy parameterization tuned to present-day ocean 
eddies* and simulated a Snowball Earth ocean under 200-m-thick ice with no 
geothermal heating and therefore not at a thermodynamic equilibrium, but with a 
quasi-equilibrated circulation. They found a very weakly stratified ocean and 
described a vanishing Eulerian velocity field, yet with strong high-latitude para- 
meterized eddy-driven meridional circulation cells, very different from the vig- 
orous Eulerian equatorial overturning and jets we found here. 

The 2D ocean model uses a horizontal diffusion coefficient of kj, = 200m” s~ 
and viscosity of vy = 2 X 10*m’s |; the 3D “realistic geometry” case uses differ- 
ent values of x, =500m?s ' and y,=5X10'm’s | required by its different 
resolution. The vertical viscosity in both models is set to vy = 2X 10. *m’s ‘and 
the vertical diffusivity to x, = 1X 10-*m’s'. The high-resolution case uses a 
Leith eddy viscosity formulation”. 

The 2D ocean model (latitude and depth, periodic in longitude) has a horizontal 
latitudinal resolution of 1° and 32 vertical levels, varying in thickness from 10m 
adjacent to the ice to 200 m at depth (vertical grid spacing varies from 920m at 
the top, entirely within the ice, followed by 15 levels of 10 m thickness each, and 
then 12 m, 17 m, 23 m, 32 m, 45 m, 61 m, 82m, 110m, 148 m, and seven levels of 
thickness 200 m at the bottom). The 2D ocean model is coupled to a 1D (latitude) 
version of the ice flow model. We performed sensitivity runs to the location and 
amplitude of the geothermal forcing and found the solution to be very robust. 

The 3D near-global ocean model was run at a horizontal resolution of 2° from 
82° S to 82° N, and with 73 vertical levels of varying thickness from 10 m near the 
ice to 200 m at depth (starting from the ocean surface, the first level is 550 m thick, 
followed by 57 levels whose thickness is 10m, and then 14m, 20 m, 27 m, 38 m, 
54m, 75m, 105m, 147 m, and finally seven levels of thickness 200 m; at steady 
state, the upper 33 levels are within the ice, and therefore inactivated, and the rest 
represent the ocean). Continental configuration is based on the 720-million-year- 
old Neoproterozoic reconstruction of ref. 30, and includes an estimated location of 
spreading ridges between the continents, where geothermal heating is probably 
enhanced (thick black contour in Fig. 3a; spreading ridges elsewhere are not 
included owing to the uncertainty in their location). Bathymetry reconstructions 
for the Neoproterozoic are not easy to come by, and we therefore specified a flat 
topography in the 3D case, relying on land masses to restrict zonal flows. We 
performed sensitivity runs with specified sills (1 km high) around the ocean con- 
stricted by the land masses, as well as with (1 km high) mid-ocean ridges specified 
in the open ocean, and found that these do not change the overall picture. 

The high-resolution sector ocean model spans 45° longitude, from 10.5° S to 
10.5° N, at a horizontal resolution of an eighth of a degree longitude and latitude 
and 20 vertical levels of thickness 100 m each. An ice cover with uniform thickness 
of 1,025 m is prescribed in this case rather than using the ice flow model (which 
would be prohibitive owing to the computational cost). A flat ocean topography is 
used, and enhanced geothermal heating is prescribed as a Gaussian centred around 
6°N, at up to four times the amplitude of the background geothermal flux. The 
surface temperature in this experiment is uniform (due to its relatively small 
meridional extent) and set to —44.4° C. This high-resolution run was integrated 
for over 100 years to reach an equilibrium of the eddy field, yet because its ice cover 
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is specified rather than evolving, the run is—unlike for the other two models—not 
at a complete thermodynamic equilibrium with its ice cover. 

While geothermal heat can have some effect on the present-day ocean”, it is the 
dominant forcing in a Snowball Earth ocean. A hydrothermal heat flux is therefore 
prescribed here at the bottom of the ocean and is spatially uniform except where 
enhanced flux is prescribed to simulate the effect of spreading ridges. These 
enhanced geothermal areas are prescribed in the model runs shown in the paper 
to be not far from the Equator (motivated by the Neoproterozoic continental 
reconstruction of ref. 30), and the geothermal flux there is up to four times its 
background value. The observed background flux away from ocean ridges is about 
0.05 W m 7, and the flux observed at mid-ocean ridges is about four times larger, 
such that the average over ocean basins is estimated to be of the order of 0.1 Wm * 
(see, for example, table 4 of ref. 39). We therefore prescribe a flux whose spatial 
average is 0.1 Wm ”, and which is larger locally over ridges. At a steady state the 
average flux escaping the ocean through the thick ice cover is therefore also 
0.1 Wm ”. Supplementary Fig. 2 shows the hydrothermal forcing of all three 
models shown in the paper. The Neoproterozoic flux may have been slightly larger 
than that of the present day, but the difference is expected to be small relative to 
other uncertainties, so that modern values are used here. 

Most ocean models cannot incorporate an ice cover thicker than their upper 

level, which is of the order of 5-50 m. It is, however, important to use a thick shelf 
ice formulation as done here, given that representing thick ice-shelves as thin sea 
ice is known to lead to biases in modelling present-day deep-water formation 
around Antarctica owing to the induced biases in freshwater fluxes due to melting/ 
freezing’. An alternative to our thick ice-shelf formulation was used by ref. 11, 
which employed a dynamic rescaling of the vertical coordinate’. 
Derivation of the solution for the 2D flow field. Consider the zonal equatorial 
jets seen (for example) in Fig. 1. Differentiate the hydrostatic equation in y and 
integrate in z to find p, = gp,z + F(y), and assume this meridional pressure gra- 
dient set by the differential geothermal heating to be approximately constant in 
latitude y around the Equator. The meridional momentum equation then leads to 
an expression for u, and the zonal momentum equation to an expression for v. 
Requiring the vertically integrated meridional flow to vanish owing to mass con- 
servation, we find F(y), giving: 


u =~ [gp,(z + H/2)/(Bpo)|/y) 
v~ —[2vngp,(z + H/2)/(B"po)](1/y*) 


If the geothermal heating and bathymetry are prescribed to be uniform, the 

MOC is weaker by about 75%, and if the ice-surface meridional temperature is also 
uniform, the MOC vanishes. This suggests that the atmospheric forcing via the ice- 
surface meridional temperature is responsible for about 25% of the MOC, in spite 
of the thick ice cover. The atmospheric driving of the Snowball Earth ocean, and of 
the MOC in particular, is through its effect on melting rates at high versus low 
latitudes. This difference mostly depends on the Equator-to-pole atmospheric 
temperature difference, and if the carbon dioxide is increased, for example, and 
if the meridional temperature difference is consequently lowered, then the atmo- 
spheric driving of the ocean circulation may be weaker. In addition to this, the 
sensitivity of the ice flow to surface temperatures derived from different carbon 
dioxide concentrations was explicitly discussed by ref. 15. When the geothermal 
forcing is uniform (but the ice-surface meridional temperature is still a function of 
latitude y), or the ridge with enhanced heating is placed at the Equator, the north- 
south symmetry is still broken, and although the MOC is weaker and could be in 
either direction depending on initial conditions, the flows are qualitatively similar 
to the above. 
On the eddy field in the high-resolution model run. The relatively strong zonal 
velocities that develop in the above 2D model near the Equator are characterized 
by a Reynolds number (based on molecular viscosity) that is much greater than 
one, of R= uL/v = (0.03ms_ ')(2 X 10° m)/(1.8 X 10 °m’s ') +4 X 10”, anti- 
cipating a turbulent flow. In addition, the meridional gradient of vorticity (6 — 
Uyy) changes sign as function of latitude, indicating that eddies may develop owing 
to barotropic instability. It is also possible that the eddy field may be able to extract 
available potential energy from the (weak) meridional density gradient. 

The eddy motions seen in our high-resolution run are characterized by a velo- 
city scale of u’ ~ 0.02 ms ‘anda length scale of ! ~ 100 km (Fig. 2c). These scales 
can be used to estimate a mixing-length eddy viscosity of vy ~ u'1~2 X 10° m*s 
A mixing timescale from low to high latitudes is therefore given by t ~ R7/Vp_ ~ 
500 years, where R is the Earth’s radius, not very different from present-day basin- 
scale mixing rates. This is a lower bound on mixing timescales, because one expects 
the snowball eddy field to be weaker at the higher latitudes where the mean flows 
and resulting eddy field are weaker. This is in contrast to the present-day ocean, 
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where eddy generation is strong at mid- to high latitudes near western boundary 
currents. 

Salinity uncertainties. Estimating ocean salinity during the Neoproterozoic pre- 
sents a challenge. We assume the pre-Snowball Earth salinity to be equal to that of 
the present day, but there is significant evidence that it varied significantly and may 
have been as high as twice that of the present day”; see also some recent reviews of 
evaporites through time****. Salinity may also have increased during Snowball 
Earth events if oceans continued to receive a supply of salts through both subgla- 
cial runoff and subglacial sediment transport, whereas evaporite deposition would 
have been nonexistent. The main effect ofa higher ocean salinity on our simulation 
would have been the additional lowering of the freezing temperature, which is 
already quite low relative to modern values owing to both high pressure at the ice 
base and high salinity. 

Another source of uncertainty in salinity is our assumed concentration of salt as 
a result of land-ice volume being equivalent to about 1 km of sea level. Modelling 
studies**”” estimate continental ice sheets to be a few kilometres thick after about 
100,000 years, roughly comparable to the East Antarctic Ice-Sheet with a mean 
thickness of 2.15 km (ref. 48). In addition, geological relationships” estimated a 
post-glacial sea-level rise of 1.5 km, which is equivalent to melting 3.3-km-thick 
grounded ice sheets on all continents and oceanic platforms. We conclude that 
although no tight constraint on land ice exists, 1-km-thick ice sheets seem to be 
within the range of current estimates. 

Yet another important salinity-related issue is the possibility of freshwater input 
from land-based subglacial flows. Such freshwater flow at the ocean surface, which 
may develop should there be any open coastal water, could be mixed down by the 
strong vertical convective mixing. Yet, if the fresh water input is strong enough, it 
could lead to a weak local coastal stratification (as seen in our solution) over the 
ridge due to melting there (Fig. 1b). This scenario remains speculative, given that it 
hinges on the existence of open coastal water, which our model does not explicitly 
predict. 
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Amphibious flies and paedomorphism in the 


Jurassic period 


Diying Huang’, André Nel’, Chenyang Cai', Qibin Lin’ & Michael S. Engel*** 


The species of the Strashilidae (strashilids) have been the most per- 
plexing of fossil insects from the Jurassic period of Russia and 
China’’. They have been widely considered to be ectoparasites of 
pterosaurs or feathered dinosaurs, based on the putative presence of 
piercing and sucking mouthparts and hind tibio-basitarsal pincers 
purportedly used to fix onto the host’s hairs or feathers’ °. Both the 
supposed host and parasite occur in the Daohugou beds from the 
Middle Jurassic epoch of China (approximately 165 million years 
ago)”*. Here we analyse the morphology of strashilids from the 
Daohugou beds, and reach markedly different conclusions; namely 
that strashilids are highly specialized flies (Diptera) bearing large 
membranous wings, with substantial sexual dimorphism of the 
hind legs and abdominal extensions. The idea that they belong to 
an extinct order? is unsupported, and the lineage can be placed 
within the true flies. In terms of major morphological and inferred 
behavioural features, strashilids resemble the recent (extant) and 
relict members of the aquatic fly family Nymphomyiidae. Their 
ontogeny are distinguished by the persistence in adult males of 
larval abdominal respiratory gills, representing a unique case of 
paedomorphism among endopterygote insects. Adult strashilids 
were probably aquatic or amphibious, shedding their wings after 
emergence and mating in the water. 


Order Diptera 
Strashila daohugouensis sp. nov. 


Etymology. The name refers to the locality of the type specimen. 
Holotype. Nanjing Institute of Geology and Palaeontology (NIGP) 155020, 
a complete male with details of head, associated with female allotype, 
without counterpart. 

Referred material. Allotype NIGP156171; paratypes NIGP155017, 
NIGP155018, NIGP155019, NIGP155022, NIGP155023, NIGP155024, 
NIGP155025, NIGP155027 and NIGP156170. 

Locality. The Xiayingzi outcrop of the Daohugou beds, Daohugou 
Village, Ningcheng County, Inner Mongolia, China. The exception 
is NIGP115019 (see Supplementary Information). 

Horizon. Jiulongshan Formation, Middle Jurassic epoch. 

Diagnosis. The new species greatly resembles the type species, 
Strashila incredibilis, except that it has shorter and more swollen male 
metafemora (see Supplementary Information for further details of the 
new species). 

The enigmatic Jurassic-period insect Strashila incredibilis displays a 
hypognathous head with compact antenna and a short ‘suctorial beak’, 
an apterous thorax, prominent chelate hind legs, and fleshy lateral 
abdominal appendages’. Only a few strashilid males have been 
reported from the Late Jurassic epoch of Russia’ and Middle Jurassic 
epoch of China’’, and therefore knowledge of these enigmatic insects 
has been very limited. Recent descriptions add little critical informa- 
tion with regard to their functional morphology and systematic affi- 
nities, and attribute strashilids to an extinct, putatively ectoparasitic 
order’. The present study is based on 13 new specimens of 2 different 


forms, Strashila daohugouensis sp. nov. (9 males, 2 females) and Vosila 
sinensis (2 males) (we demonstrate that Vosila sinensis and Parazila 
saurica are the same species; for details see Supplementary Informa- 
tion) from Daohugou. These specimens are rare among the 100,000 
fossil insects recovered from Daohugou and among the collections of 
the Nanjing Institute of Geology and Palaeontology. 

Although definitive Middle Jurassic ectoparasites (fleas) on verte- 
brates were reported recently from the Daohugou outcrops*”, earlier 
speculation regarding strashilids as terrestrial ectoparasites can be 
rejected owing to an alternative hypothesis stemming from the dis- 
covery of hitherto unknown large membranous wings (Fig. li) and 
females of Strashila, in two cases preserved in copula (Fig. 1a, e). Males 
and females have identical head morphologies but differ strongly in the 
absence of the hind leg pincers in females, which excludes ectoparasit- 
ism on terrestrial vertebrates as these are reinterpreted as sex-related 
structures for grasping the female rather than a host’s integument. In 
addition, unlike ectoparasitic insect lineages, the body is cylindrical in 
strashilids (Fig. le), rather than dorsoventrally or laterally flattened. 

Male strashilids have genitalia of antliophoran type’** that are 
thought to have terminalia characteristic of Holometabola and are 
close to those of Mecopteroidea, but with primitive volsellae (small 
accessory clasping sclerites)”. This has led to the suggestion that they 
are an extinct order and a sister group to the Mecoptera’. In fact, the 
male genitalia are virtually identical with those of the recent nemato- 
ceran flies'' (for example, Chironomidae, Sciaridae or some Nym- 
phomyiidae) rather than Mecoptera. Examination of the genitalia in 
finer detail reveals that they are clearly dipteran. The male tergite 
VIII shows a rounded anterolateral process (Figs 1c, f and 2c, and 
Supplementary Fig. 3i), and it has a volsella of typical dipteran mor- 
phology, gonocoxites articulated on the posterolateral sides of the 
transverse tergite, and gonostyli with terminal grasping hooks. The 
presence of these features in combination reinforces the placement of 
strashilids among basal lineages of the true flies'*’* (Figs 1c, f and 2c, 
and Supplementary Figs 3i and 6a). 

The relatively small, oval, prognathous head bears a pair of large 
compound eyes with numerous facets and a pair of lateral ocelli, which 
are not completely or virtually lost or reduced as in ectoparasitic 
insects (Fig. 1b, h and Supplementary Figs la-d, 2b, c, e, i and 3b). 
Together with the presence of short antennae, this supports further the 
affinity of the Strashilidae with true flies. The mouthparts are vestigial, 
remarkably reduced in both sexes (Fig. 1b, h and Supplementary Figs 
2e and 3b), suggesting that the adults do not feed. The antennae have a 
scape and a pedicel, both of which are large, and a long flagellum with 
the main section annulated, comparable in critical details with those of 
nymphomyiids"* (Fig. 1h and Supplementary Figs 2b, c, e, i and 3b). 

The thorax of strashilids shows the typical subdivision of the thorax 
of true flies (Fig. la, g and Supplementary Fig. 2h). The male wings, 
large and membranous, are fringed with numerous long setae (Figs 1i, j 
and 3a, and Supplementary Figs 5a, e, fand 7b) and are similar to those 
of nymphomyiids, a primitive lineage of aquatic flies. The wings bear a 
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Figure 1 | Strashila daohugouensis sp. nov. from the Middle Jurassic epoch 
of Daohugou, China. a, Mating pair, holotype (NIGP155020, male) and 
allotype (NIGP156171, female). b, Enlargement of the head of the male in 

a, white arrows indicate ocelli. c, Enlargement of the male genitalia in 

a. d, Enlargement of the female terminalia in a. e, Another mating pair, male 


few simple veins (Supplementary Fig. 5b, c) and closely resemble those 
of recent Nymphomyia walkeri in this respect'*. The broadly separated 
coxae, a feature that is considered to be consistent with the hypothesis 
of ectoparasitism, is also found in species with an aquatic habit and 


Figure 2 | Vosila sinensis from the Middle Jurassic epoch of Daohugou, 
China. a, Complete male (NIGP155026). b, Nearly complete male 
(NIGP155021a). c, Enlargement of the male genitalia in b. d, Enlargement of 


(NIGP155018a) and female (NIGP156170b). f, Enlargement of the male 
genitalia in e. g, Large male specimen (NIGP155017). h, Enlargement of the 
female head in e. An, antenna; E, eye. i, A male (NIGP155024) with wing. 

j, Enlargement of the marginal setae in i. Scale bars, 2 mm (a, e, g, i), 

500 pum (b-d), 200 pm (f) and 1 mm (j). 


therefore cannot alone support attribution of the fossils with the former 
life-history scenario. The basitarsi of the fore and mid legs were armed 
with a row of small denticles (Supplementary Figs 1f, 2b, g, i and 4d), 
and were probably used for grasping. The strongly curved and sharply 


at ou ». 


the dorsoventrally compressed gills in b. e, Enlargement of the laterally 
compressed gills in b. Scale bars, 2 mm (a, b), 1 mm (d, e) and 500 jim (c). 
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Figure 3 | Strashila dachugouensis sp. nov. from the Middle Jurassic epoch of Daohugou, China. a, Reconstruction of the male. b, Reconstruction of the wing- 


shed female. 


pointed pretarsal claws (Supplementary Figs 1g, 2b and 4d, e) corres- 
pond to adaptations for anchoring to fixed objects in the water 
column”. The male’s swollen metafemur and metatibia, and its strong 
pincers (Figs la, e, g and 3a, and Supplementary Figs 2a, d, f, j, 3a, f, g 
and 5a), are sexually dimorphic as females have otherwise normal 
dipteran hind legs (Figs 1a, e and 3b and Supplementary Fig. 3a, f), 
and male pincers were used in either male-male competition or for 
holding the female during mating (Figs 1a, e and 4, and Supplementary 
Figs 3a, f, g and 7a), similar to some modern male oedemerid beetles’”. 
The swollen legs also parallel the swollen tarsi of modern male dance 
flies (Empididae), and would have been used as visual signals to attract 
females’®. 

The most puzzling features of the strashilids are the pairs of lobes 
on the sides of the abdomen in males, which are inconsistent with 
terrestrial locomotion or ectoparasitism. Previous authors considered 
the function of these structures to be obscure” or thought that they 
were elaborate versions of the eversible abdominal glands of higher 
mecopterans*. These structures probably do not correspond to prolegs 
as in some dipteran larvae (for example, Nymphomyiidae and Empi- 
didae) because of their lateral insertion and absence of apical crochets. 
Vosila and Strashila display homologous abdominal extensions. Vosila 
have two series of lateral filamentous projections (Fig. 2a, b, d, e and 
Supplementary Fig. 4a, c), similar to those of Hydrochara affinis"”, 
which correspond to gills typically used for aquatic respiration, such 
as in mayflies, many beetles, dobsonflies, caddisflies and some cater- 
pillars*’**. In contrast, the tube-like extensions of Strashila (Figs 1g and 
3a, and Supplementary Figs 2a, d, f, h, j, 3d, 5d and 6a-c) are compar- 
able to those on segments V and VI of some male nymphomyiids, 
possibly also for grasping females”. Some Daohugou winged 
adult Ephemeroptera also have paired, slender, filamentous, lateral 


abdominal gills (Supplementary Fig. 6a-g). Although similar gills 
occur in a few modern Ephemeroptera and Plecoptera, there is no 
record of adult Holometabola having well-developed and functional 
filamentous gills***°. The presence of such gills indicates a continua- 
tion of larval structures into the adult stadium, representing a unique 
form of paedomorphism associated with this major biological func- 
tion. To our knowledge, the very well preserved male NIGP155017 is 
the first strashilid fossil to show an abdominal tracheal system con- 
nected to spiracles (Fig. 1g and Supplementary Fig. 6a, b) that is similar 
to those of the aquatic larvae of the beetles Hydrophilus, Hydrochara 
and Berosus’’, and the dobsonfly Neuronemus, in which the spiracles 
serve a respiratory function”®. 

These structures provide evidence that strashilids share several 
potential apomorphies with modern nymphomyiid flies, including 
the finer details of the antennae, ocelli, wings, terminalia and legs 
(see Supplementary Information for details). Their reduced mouth- 
parts and wings are inefficient for feeding and active flight, respectively, 
and indicate an ephemeral life history, akin to modern nymphomyiid 
taxa. Similar to modern nymphomyiids, adult Strashilidae probably 
mated soon after emergence, shed their wings, crawled beneath the water, 
mated, and in some cases died in copula’*'***”’, with the superposing 
male grasping the female below, thereby explaining the discoveries of 
females grasped by males, and both without their wings (Fig. 4). 

The discovery of fossilized females and of several interesting char- 
acteristics of the wings resolves one of the long-standing questions of 
insect palaeontology and evolution—namely the biology and func- 
tional morphology of the enigmatic Strashilidae. We conclude that 
the Strashilidae were a highly specialized dipteran clade related to 
Nymphomyiidae, with an aquatic or amphibious life history. These 
new discoveries also aid our understanding of modern nymphomyiid 


Figure 4 | Ecological reconstruction of Strashila dachugouensis sp. nov. 
from the Middle Jurassic epoch of Daohugou, China. Two males are shown 
flying. A female with wings is shown crawling on land (top right), and two 
males and a female (top left) are shown searching for mates in water, having 


96 | NATURE | VOL 495 | 7 MARCH 2013 


shed their wings (isolated wings are also shown floating on the water). The 
illustration of the mating pair (front) shows how the male may have grasped the 
female. 
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morphology, suggesting that the abdominal extensions in males of 
some species may be a paedomorphic remnant, and emphasizing a 
unique developmental syndrome for future evolutionary developmental 
investigation. The Strashilidae were not parasitic. Ectoparasites of early 
mammals or feathered dinosaurs, principally fleas, were indeed present 
in the Daohugou fauna”, but the strashilids were not one of them. 


METHODS SUMMARY 


The specimens are housed at the Nanjing Institute of Geology and Palaeontology, 
and were prepared using a sharp knife. Photographs were taken using a Zeiss 
Discovery V20 microscope system, with specimens moistened with 70% alcohol 
(Figs 1b-d, h and 2a, c-e, and Supplementary Figs 2b, c, e, g, i, j, 3a-f, h, i and 
4b-e, g), and line drawings were made using a camera lucida on a binocular 
Olympus SZX7. Reconstructions were prepared with Corel Painter and Adobe 
Photoshop software. Finally, some specimens were examined with a LEO1530VP 
field emission scanning electron microscope. 
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NFIB is a governor of epithelial-melanocyte stem cell 
behaviour in a shared niche 
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Adult stem cells reside in specialized niches where they receive envir- 
onmental cues to maintain tissue homeostasis. In mammals, the 
stem cell niche within hair follicles is home to epithelial hair follicle 
stem cells and melanocyte stem cells, which sustain cyclical bouts of 
hair regeneration and pigmentation’*. To generate pigmented 
hairs, synchrony is achieved such that upon initiation of a new 
hair cycle, stem cells of each type activate lineage commitment”. 
Dissecting the inter-stem-cell crosstalk governing this intricate coor- 
dination has been difficult, because mutations affecting one lineage 
often affect the other. Here we identify transcription factor NFIB as 
an unanticipated coordinator of stem cell behaviour. Hair follicle 
stem-cell-specific conditional targeting of Nfib in mice uncouples 
stem cell synchrony. Remarkably, this happens not by perturbing 
hair cycle and follicle architecture, but rather by promoting mela- 
nocyte stem cell proliferation and differentiation. The early produc- 
tion of melanin is restricted to melanocyte stem cells at the niche 
base. Melanocyte stem cells more distant from the dermal papilla are 
unscathed, thereby preventing hair greying typical of melanocyte 
stem cell differentiation mutants. Furthermore, we pinpoint KIT- 
ligand as a dermal papilla signal promoting melanocyte stem cell 
differentiation. Additionally, through chromatin-immunoprecipi- 
tation with high-throughput-sequencing and transcriptional profi- 
ling, we identify endothelin 2 (Edn2) as an NFIB target aberrantly 
activated in NFIB-deficient hair follicle stem cells. Ectopically 
induced Edn2 recapitulates NFIB-deficient phenotypes in wild- 
type mice. Conversely, endothelin receptor antagonists and/or KIT 
blocking antibodies prevent precocious melanocyte stem cell differ- 
entiation in the NFIB-deficient niche. Our findings reveal how mela- 
nocyte and hair follicle stem cell behaviours maintain reliance upon 
cooperative factors within the niche, and how this can be uncoupled 
in injury, stress and disease states. 

Hair follicle stem cells and melanocyte stem cells remain quiescent 
within their hair follicle niche for weeks, a period known as telogen 
phase. With each new hair cycle, these two stem cell populations are 
coordinately activated. This happens when inhibitory signals are coun- 
teracted by activating cues that accumulate from Wnt and BMP/TGF- 
B (bone morphogenetic protein/transforming growth factor B) cross- 
talk with dermal papilla at the niche base**. Synchronized activity 
continues throughout the hair cycle. During the growth phase (ana- 
gen), melanocytes at the base of the mature hair follicle (‘hair bulb’) 
produce and transfer pigment to neighbouring committed hair follicle 
stem cell progeny (‘matrix’) as they differentiate into hair cells*°. When 
destruction (catagen) ensues, melanocytes and matrix cells in the hair 
bulb apoptose, and the dermal papilla (enveloped by the hair bulb 
during anagen) retracts upward, returning the follicle to telogen. As 
anagen begins and a new hair bulb emerges, both hair follicle stem cells 
and melanocyte stem cells contain nuclear B-catenin, implicating 
canonical Wnt signalling in stem cell coordination®*. These and several 


other insights”””° suggest how local environmental signals synchronize 


proliferation and lineage progression of stem cells during hair cycling. 

Uncoupling melanocyte and epithelial stem cell behaviours occurs 
under transient conditions, that is, in response to ultraviolet radiation, 
and in various disease and injury states'’'*. Given the impact of Wnt 
and other signals on stem cells and their lineages, and current dogma 
that matrix cells must differentiate for melanocyte pigment to transfer’, 
the mechanisms by which melanocyte stem cells can be selectively 
mobilized from their niche without otherwise disrupting the normal 
hair cycle remains unknown. 

Our venture into this study was prompted by our finding that rela- 
tive to progeny, hair follicle stem cells express elevated nuclear factor 
1/B (NFIB)'. NFIB is required for lung and brain development and is 
often amplified and/or found at oncogenic chromosomal breakpoints 
in epithelial cancers'**. NFIB was first detected in epidermis at 
embryonic day 14.5 (E14.5), concomitant with upregulation of estab- 
lished skin progenitors. Expression intensified as hair follicle stem cells 
emerged (Fig. la and Supplementary Fig. la-c). 

In adult hair follicles, NFIB co-localized with hair follicle stem cells, 
whose niche in telogen is subdivided into an upper ‘bulge’ compart- 
ment and lower ‘hair germ’ (or secondary hair germ) adjacent to 
dermal papilla (Supplementary Fig. 1d). During anagen, NFIB- 
positive cells were also found within the upper outer root sheath 
(ORS), which in early catagen will form the new niche (bulge and 
hair germ) for the next hair cycle’® (Fig. 1b). NFIB waned in transit- 
amplifying (TA) matrix progenitors (Fig. 1b and Supplementary 
Fig. le). NFIB was not detected in melanocyte stem cells marked by 
dopachrome tautomerase (DCT) and tyrosine kinase receptor KIT in 
the upper ORS and bulge/hair germ, nor in differentiated melanocytes 
within the hair bulb’ (Fig. 1b, c). Overall, both inside and outside the 
niche, hair follicle stem cells and melanocyte stem cells showed synchro- 
nized behaviours but distinct expression patterns. 

To address the function of NFIB, we conditionally induced Nfib 
ablation (cKO) in mouse hair follicle stem cells by using Sox9-CreER 
and K15-CrePGR. Unless specified, data are from Sox9-CreER mice, but 
both gave similar results. NFIB immunofluorescence verified that Nfib 
was efficiently targeted in hair follicle stem cells, consistent with K15 
and Sox9 expression primarily in ORS/bulge/hair germ; furthermore, 
when Rosa26"°*STOP"*YEP (RosaYFP; yellow fluorescent protein 
(YFP)) was used to mark and trace the progeny of Cre-induced cells, 
only hair follicle stem cells and their subsequent progeny, and not KIT* 
melanocytes or dermal cells, were fluorescently labelled’® (Figs 1d, e and 
Supplementary Fig. 2a—c). Despite efficient targeting, however, Nfib- 
cKO hair coats and hair cycling appeared normal (Fig. 1f, g). 

Equally surprising to the absence of hair follicle stem cell lineage 
defects were melanocyte lineage abnormalities. Fontana-Masson 
staining revealed atypical melanogenic cells at the niche base of telogen 
hair follicles (Fig. 2a and Supplementary Fig. 2d). Immunostaining 
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Figure 1 | Conditional Nfib targeting in hair follicle stem cells does not 
perturb hair cycle or follicle architecture. a—c, Immunofluorescence. 

a, Enrichment of nuclear NFIB in hair follicle stem cells and ORS of developing hair 
follicles. ECAD, E-cadherin; HFSC, hair follicle stem cells; Mx, matrix. b, NFIB in 
anagen hair follicles from adult Det-EGFP BAC transgenic mice. NFIB is not seen in 
EGFP* melanocytes. Ana, anagen; Bu, bulge; upORS, upper ORS. c, Absence of 
NFIB in KIT~ melanocyte stem cells of telogen hair follicles. DP, dermal papilla; HG, 
hair germs Telo, telogen. d-f, Tamoxifen (TAM) was administered to Sox9-CreER/ 
Nfib""/RosaYFP-cKO mice and analysed at various times thereafter. d, Note hair 
follicle stem-cell-specific Nfib targeting in bulge and hair germ. SG, sebaceous gland. 
e, Note YFP reporter activity in hair follicle stem cells but not in KIT* melanocyte 
stem cells. f, Schematic. Cata, catagen; P20, postnatal day 20. g, Haematoxylin- and 
eosin-stained back skins reveal normal hair cycle and follicle architecture upon NFIB 
loss. Ctrl, control. Scale bars: 50 Lum (P32, P38 in g); 25 jum (a, b, P74 in g); 10 jum (c-e). 


showed increases in melanocytes (KIT* DCT™) throughout hair germ 
and bulge, and ectopic presence of differentiated melanocytes 
(KIT*DCT*TYRP1* MITE’ TYR*) within hair germ (Fig. 2b, c and 
Supplementary Fig. 3a-c). 

A priori, ectopic differentiated melanocytes in cKO hair germs 
might reflect hair bulb melanocytes that somehow escaped apoptosis 
during catagen. Alternatively, they could arise from precocious differ- 
entiation of niche melanocyte stem cells. To distinguish between these 
possibilities, we analysed melanocyte behaviour at each hair cycle stage 
(Fig. 2d and Supplementary Fig. 4). During anagen, TYRP1" cells 
dropped to wild-type levels in the bulge/upper ORS. As dermal papilla 
returned to the niche during late catagen, TYRP1* numbers again 
soared, persisting until the next anagen. By contrast, control stem cell 
niches displayed much more modest melanocyte stem cell differentia- 
tion, which occurred at telogen—anagen rather than catagen—telogen. 
The difference in this timing was >3 weeks for young adult mice. 

Accompanying the disappearance of differentiated melanocytes 
within the anagen Nfib-cKO bulge/upper ORS, was their appearance 
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in the hair bulb (upper matrix) (Fig. 2e and Supplementary Fig. 4). 
Within the bulb, pigment transfer to differentiating hair cell recipients 
seemed normal. Additionally, at the start of catagen, TYRP1* mela- 
nocytes underwent apoptosis and were eliminated by mid-catagen in 
both control and cKO hair bulbs. In contrast to differentiated mela- 
nocytes, TYRP1  melanocyte stem cells remained elevated through- 
out cKO hair cycles, where they resided in the bulge/upper ORS. The 
only time at which melanocyte stem cells dropped transiently was 
during late catagen/telogen, when melanocyte stem cells near dermal 
papilla precociously differentiated. 

These results pinpointed defects to the stem cell niche; furthermore, 
dermal papilla proximity seemed to affect primarily the uncoupling and 
premature differentiation of melanocyte stem cells rather than self- 
renewal (Fig. 2f). In addition, melanocyte stem cells in anagen stem- 
cell niches were still negative for TYRP1 even in 1-year-old Nfib-cKO 
hair follicles (Fig. 2d), indicating that ectopic differentiated melanocytes 
from telogen do not accumulate within the niche over hair cycles. 

To test melanocyte stem cell activity further, we mated Nfib-cKO to 
Dct-EGFP mice and isolated EGFP” bulge cells from telogen hair 
follicles. Melanocyte differentiation markers, for example, Kit, Mitf, 
Tyrp1, Tyr and Sox10, were upregulated in EGFP cells, whereas genes 
expressed by both melanocyte stem cells and differentiated melano- 
cytes showed no change (Supplementary Fig. 3d). Finally, hair coats of 
ageing cKO mice were still pigmented, and melanocyte stem cell levels 
were sustained (Fig. 2e, g and Supplementary Fig. 3e). These data 
provided compelling evidence that NFIB-deficiency in hair follicle 
stem cells affects the timing of melanocyte differentiation without 
compromising melanocyte stem cell biology and/or function, which 
results in hair greying®”*"". 

Ultrastructural analysis unveiled new defects within the stem cell 
niche (Fig. 3a and Supplementary Fig. 5). Niche melanocytes closest to 
dermal papilla had pigment granules and immature melanosomes. 
Surprisingly, however, adjacent hair germ cells also had pigment gra- 
nules. Their epithelial identity was shown by keratin filaments, desmo- 
somes and hemidesmosomes. Although poorly understood, hair cell 
uptake of pigment from differentiated melanocytes is thought to 
depend on FOXN1”°, expressed by differentiating matrix cells but 
not by hair follicle stem cells’. 

Inappropriate accumulation of pigment proved calamitous for qui- 
escent hair follicle stem cells in the telogen hair germ: it elicited their 
apoptotic death, typified by condensed chromatin, mitochondrial 
destruction, and cleaved caspase-3-immunolabelling (Fig. 3a, b and 
Supplementary Fig. 6). In addition, unaffected neighbouring K5- 
positive hair germ cells proliferated (Fig. 3c, d and Supplementary 
Fig. 6). Although hair germ proliferation is a normal sign of 
telogen—anagen’, precocious hair cycle entry was not observed. 

Apoptotic and proliferative defects within the niche disappeared in 
anagen (Fig. 3b, d), concomitant with movement of differentiated 
melanocytes from niche to hair bulb (Fig. 2d-f). These results indicate 
that apoptosis and hyperproliferation of Nfib-null hair follicle stem 
cells depend upon precocious differentiation of neighbouring melano- 
cyte stem cells. They also indicate that when differentiated melanocytes 
inappropriately bequeath pigment to hair follicle stem cells rather than 
their customary differentiated progeny (hair cells), pigment-laden hair 
follicle stem cells are unable to cope, whereas healthy hair follicle stem 
cell neighbours proliferate to restock the niche. 

Notably, in older Nfib-cKO mice, black-pigmented dermal cells 
swarmed hair follicles and were even visible from the skin surface 
(Supplementary Fig. 7). Although dermal melanoblasts exist in certain 
skin regions such as the ear, and can differentiate under some conditions”, 
few TYRP1~ melanocytes were found in cKO dermis. Rather, a number of 
cells encompassing this pigment were Macl* with features of macro- 
phages. Irrespective of whether pigment-laden vacuoles within dermal 
cells reflected engulfment of dying, pigmented cells, or direct pigment 
transfer from melanocytes, these defects made the normalcy of the hair 
coat of ageing cKO mice all the more remarkable. 
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Figure 3 | Premature transfer of pigment promotes apoptotic cell death in 
hair follicle stem cells in the NFIB-deficient niche. a, Hair follicle 
ultrastructure. Pseudo-colouring highlights distinct stem cell niche cell types: 
green, hair follicle stem cell; red, melanocyte; blue, dermal papilla. Arrowheads 
mark cells shown at higher magnification. Open and solid arrows denote 
immature and mature melanosomes, respectively. Note pigment-laden 
apoptotic hair follicle stem cells whose boxed regions at higher magnification 
reveal condensed chromatin, degenerating mitochondrion, keratin filaments 
(Kf), desmosomes (De) and melanosomes. b, Activated caspase 3 antibody 
marks K5* apoptotic cKO hair follicle stem cells, seen in telogen but not anagen 
(n = 3 mice; >30 hair follicles per mouse). HFs, hair follicles; SC, stem cell. 

c, d, Increased hair follicle stem cell proliferation in telogen but not anagen Nfib- 
cKO hair follicles. Assessment was by EdU (5-ethynyl-2’-deoxyuridine) 
incorporation (administered twice, 24h before analysis) and Ki67 
immunostaining (n = 3 mice; >30 hair follicles per mouse). Scale bars: 10 um 
(low magnifications in a and b-d); 2 1m (apoptotic hair follicle stem cell in 
a);0.5 um (cytoplasmic segments and melanocyte in a). Error bars indicate s.e.m. 
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Figure 2 | NFIB loss enhances melanocyte stem- 
cell self-renewal and perturbs melanocyte stem- 
cells activity in the hair follicle stem cell niche. 
a, Ectopic pigmentation in telogen-phase cKO hair 
germ as detected by Fontana—Masson melanin 
staining and quantifications (n = 3 mice per 
experiment, >40 hair follicles per mouse). 

b-e, Immunofluorescence and quantifications. 

b, c Increased melanocytes and ectopic 
differentiation in telogen cKO hair follicles (30-50 
hair follicles; 2 mice per experiment). KIT marks 
melanocyte stem cells and differentiated 
melanocytes; MITF and TYRP1 are differentiation- 
specific melanocyte markers. MCs, melanocytes. 
d, Precocious melanocyte differentiation begins in 
the niche at late catagen. Quantifications of 
TYRP1* among KIT or Det-EGFP* melanocytes 
(>40 hair follicles; 2 mice per experiment). 
Analyses shown start at second telogen. SC, stem 
cell. e, Quantifications of melanocyte stem cells 
(TYRP1 ) and differentiated melanocytes 
(TYRP1*) at different hair follicle stages (>30 hair 
follicles; 2 mice per experiment). f, Summary of 
data. McSC, melanocyte stem cell; MC, melanocyte. 
g, Lack of hair greying in ageing Nfib-cKO mice. 
Scale bars, 10 tm (a, b). *P < 0.001. Error bars 
indicate s.e.m. 
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To dissect the molecular miscommunication between hair follicle 
stem cells and melanocyte stem cells in the niche, we used high 
throughput RNA-seq to transcriptionally profile bulge and hair germ 
hair follicle stem cells. Fluorescence activated cell sorting (FACS) of 
skins from K15-CrePGR/RosaYFP/Nfib"" (cKO) and Nfib"’* (het) 
mice were used for purifying CD34" -bulge and CD34. -hair germ cells 
(both YEP*Scal~ «6"8"CD200*) (Supplementary Figs 2a and 8a). 

Of 800-1,000 messenger RNAs changed by= twofold in NFIB- 
deficient hair follicle stem cells relative to control, 145 were upregulated 
and 99 were downregulated (Supplementary Figs 8b and 9 and Sup- 
plementary Table 1). Quantitative PCR (qPCR) of independently purified 
samples validated the differences (Supplementary Fig. 8c, d). Notably 
absent from the list were Foxn1, Fgf2, Nog, Egfr, F2rl1 and derivatives of 
Pomc—all previously implicated in melanocyte differentiation and/or 
pigment transfer'”’*!. Also absent were genes involved in BMP/TGF-B 
signalling, known to function in stem cell niche quiescence. Similarly, 
NFIB loss did not seem to affect canonical Wnt signalling, a key stimulus 
for stem cell activity and fate commitment: Wnt-sensitive target gene 
Axin2 was unchanged in NFIB-deficient hair follicle stem cells, and 
upregulated genes included both negative and positive Wnt regulators. 

These results were consistent with the normal hair cycle displayed 
by Nfib-cKO skin (Fig. 1g). Had any of these signalling pathways been 
perturbed, both stem cell populations—not just melanocyte stem 
cells—should have been affected. Moreover, hair follicle stem cells 
from mice genetically defective for Wnt, BMP and TGF-f signalling 
still expressed NFIB protein and mRNA (Supplementary Fig. 10a, b). 
Thus at least within the quiescent stem cell niche, these signalling 
pathways seemed to be refractory to loss of NFIB, and NFIB seemed 
to be refractory to these signalling pathways. 

To identify direct NFIB target genes, we performed chromatin- 
immunoprecipitation with high-throughput-sequencing (ChIP-seq) 
analysis on 10-15 million bulge hair follicle stem cells FACS-purified 
from 15-20 mice’’. Applying immunoprecipitation-grade NFIB anti- 
body to chromatin, we identified 1,449 genes that were directly and 
reproducibly bound by NFIB (Fig. 4a and Supplementary Table 2). 
NFIB-bound genes included Krt5 (Supplementary Fig. 11a). Intri- 
guingly, like Krt5 and its transcription factor-AP2 family regulators, 
NFIB was absent in most areas of Trp63-null skin (Supplementary Fig. 
10c). Additionally, Nfib is bound by p63 (ref. 22) and harbours TFAP2 
binding motifs (data not shown), suggesting possible connections to 
these early stem cell markers. 
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Figure 4 | RNA-seq and ChIP-seq analyses identify Edn2 as a direct NFIB- 
regulated gene mediating inter-stem cell crosstalk. a, Identification of direct 
NFIB-regulated genes as those that bind NFIB (from ChIP-seq) and are up/ 
downregulated in Nfib-cKO hair follicle stem cells (from RNA-seq). b, NFIB 
ChIP-seq profiles of Edn2 promoter. NFIB palindromic-binding-sequence 
exists within the peak. c, qPCR on FACS-isolated populations confirm RNA-seq 
results (Het, normalizations to heterozygote values). d, Immunofluorescence 
shows ectopic endothelins in Nfib-cKO hair follicles. e, Immunofluorescence 
reveals KITL in dermal papilla. f, Reductions in TYRP1* melanocytes (>60 hair 
follicles; 3 mice) and apoptosis (n = 4 mice; >30 hair follicles per mouse) in cKO 
hair follicles after injecting anti- KIT blocking antibody (anti-KIT) during 


NFIB peaks were enriched in + 2 kilobases (kb; 12%) and 2-50 kb 
(29%) sequences proximal to gene transcription initiation sites 
(Supplementary Fig. 11b, c). A de novo motif search identified five 
sequences within these peaks (Supplementary Fig. 11d, e). Most com- 
mon were TGGC*” and “/™GCCA, which when combined, com- 
prised a palindromic motif. Notably, NFIB protein self-dimerizes, 
and its preferred binding motif is a TTGGCANNNTGCCAA pal- 
indrome”*. Moreover, in response to NFIB loss, 201 (~ 14%) NFIB tar- 
get genes were differentially expressed relative to control hair follicle 
stem cells (Supplementary Fig. 12). Of these, 44% were upregulated 
whereas 56% were downregulated in NFIB-deficient hair follicle stem 
cells. NFIB’s lack of apparent bias for hair follicle stem cell gene activa- 
tion differed from the role of NFIC in cultured fibroblasts”’. 

Searching for candidates whose altered expression might enhance 
melanocyte stem cell proliferation and differentiation, we focused on 
the 33 NFIB-binding genes = twofold up- or downregulated in bulge and 
hair germ hair follicle stem cells when Nfib is ablated (Fig. 4a). Notable 
was the gene Edn2, encoding endothelin-2. Within the Edn2 promoter 
was an NFI-binding palindrome sequence containing an optimal spacer 
(Fig. 4b). In both ChIP-seq replicates, NFIB bound to this site. 

Endothelins are secreted factors with the ability to mediate intercel- 
lular crosstalk. All three endothelins stimulate cultured melanocyte stem 
cells’. Although Edn3 is required for neural crest migration and mela- 
nocyte specification during embryogenesis™, and Edn1 has been impli- 
cated in Wnt-mediated melanocyte proliferation’, neither Edn3 nor 
Edn1 appeared on our list of hair follicle stem cell genes bound by 
NFIB. Moreover, as assessed by RNA-seq and qPCR, Edn3 was not 
expressed in hair follicle stem cells, and Edn1 showed low expression 
and little change upon NFIB loss (Fig. 4c and Supplementary Table 1). By 
contrast, at telogen—-anagen, Edn2 was transiently activated in wild-type 
hair follicle stem cells'®, and in Nj fib-null hair follicle stem cells, Edn2 was 
upregulated more than twofold relative to controls. Pan anti-endothelin 
immunolabelling was also stronger in Nfib-null relative to control hair 
follicle stem cells (Fig. 4d). Finally, hair follicle stem cells express endothe- 
lial converting enzyme (ECE1) necessary to process/activate EDN2. 

The Edn2 changes were cell-autonomous, because Edn2 mRNA was 
also upregulated in cultured Nfib-null keratinocytes (Supplementary 


catagen/telogen. IgG, control. g, Anti-KIT-mediated inhibition of KIT signalling 
during anagen (>45 hair follicles in two mice). h, Overhead schematic: lentiviral 
constructs. Quantifications show increased KIT* melanocytes (32 hair follicles; 
two mice per experiment) and melanin” hair follicles (n = 3 mice; >60 hair 
follicles per mouse) upon EDN2 induction. Immunofluorescence shows TYRP1~ 
melanocyte differentiation in doubly- (Edn2) and not singly-transduced/ 
uninfected (Ctrl) hair follicle stem cells. i, Reduced TYRP1 * melanocytes (>46 
hair follicles in two mice) and apoptosis (n = 3 mice; >25 hair follicles per 
mouse) in Nfib-cKO hair follicles after injection of EDNRB inhibitor BQ788. 
Scale bars: 10 um (d, e, h). ND, not determined; NS, not significant. Error bars 
indicate s.e.m. 


Fig. 13). Moreover, this difference depended upon NFIB, as it was res- 
cued by expressing the main keratinocyte isoform, NFIB3. By contrast, 
even though KIT ligand (Kit!) mRNA seemed to be modestly induced in 
Nfib-null hair follicle stem cells (Supplementary Table 1), the levels were 
not influenced by NFIB3-rescue (Supplementary Fig. 13), nor did we 
observe NFIB binding to the Kit] promoter/enhancer in ChIP-seq ana- 
lyses. Interestingly, however, whereas KITL was not detected in hair 
follicle stem cells, it was seen in dermal papilla’’ (Fig. 4e). 

Together, these results indicated a model whereby Edn2 induction 
by Nfib-null hair follicle stem cells enhances proliferation of neigh- 
bouring melanocyte stem cells and sensitizes them to precociously 
differentiate when they encounter KITL and possibly additional sig- 
nals from dermal papilla. If true, then blocking KIT signalling should 
ameliorate precocious melanocyte stem cell differentiation in cKO hair 
follicles, whereas elevating EDN2 in wild-type hair follicle stem cells 
should generate phenotypic features of NFIB-deficiency. 

We first tested this hypothesis by injecting a KIT-receptor-blocking 
antibody into the skins of cKO mice beginning in late catagen. By 
telogen, marked reductions were seen in TYRP1* melanocytes and 
in activated-CASP3" hair follicle stem cells (Fig. 4f). Importantly and 
in agreement with previous findings’’”*, KIT inhibition in anagen hair 
follicles only affected lineage-committed proliferation and differenti- 
ation: undifferentiated melanocyte stem cells remained elevated in the 
cKO niche (Fig. 4g and Supplementary Fig. 14). 

We next induced EDN2 in wild-type adult skin by co-transducing 
E9.5 mouse embryos in utero with high-titre lentiviruses”* harbouring: 
(1) a tetracycline-regulatable transactivator (rtTA) coupled in a bicis- 
tronic transcript to EGFP; and (2) H2BmRFP1 and a tetracycline- 
inducible promoter driving either Edn2 (TRE-Edn2) or nothing 
(TRE-only) (Fig. 4h). Following selective EDN2 induction (Sup- 
plementary Fig. 15), rtTA/TRE-Edn2-transduced hair follicle stem 
cell niches contained increased KIT* melanocytes, melanin* cells 
and TYRP1* differentiated melanocytes (Fig. 4h). Notably, de novo 
melanogenesis was not detected in similarly transduced epidermis, 
indicating that EDN2’s effects were confined to locations where pre- 
existing melanocyte stem cells reside (data not shown). Finally, 
premature melanocyte stem cell differentiation in Nfib-cKO niches 
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depended upon EDN2, because marked reductions in TYRP1* mela- 
nocytes and activated-CASP3* hair follicle stem cells were observed 
when an endothelin receptor inhibitor (BQ788) was injected intrader- 
mally into cKO skin from late-catagen (Fig. 4i). 

In summary, our findings expose an unexpected gatekeeper, NFIB, 
which governs activity within the quiescent stem cell niche of hair fol- 
licles. Upregulation of a single NFIB target, Edn2, seemed to be sufficient 
to uncouple coordinated stem cell behaviour. In contrast to all known 
genetic pathways perturbing melanocyte stem cell function within hair 
follicles, precocious melanocyte differentiation and ensuing chaos was 
cyclical and did not compromise stem cell pools, hair pigmentation or 
growth, even in ageing animals. The mechanism underlying this unpre- 
cedented phenotype lends new importance to the two-tiered structure of 
the hair follicle stem cell niche, and to dermal papilla/KITL-independent 
and dependent steps in controlling the effects of EDN2. 

Endothelin 2 has not been implicated hitherto in normal cutaneous 
melanocyte physiology. Moreover, whereas melanocyte specification 
during embryogenesis fails to occur without Edn3 (ref. 27), melano- 
genesis is seemingly normal in Edn1 knockout mice’*. This has left it 
unclear as to whether endothelins function postnatally. Similarly to 
Edn3, elevating Edn2 in epidermis did not promote melanogenesis”; 
however, our results show that when provided with the proper micro- 
environment and additional stimulatory signals, endothelins can 
influence adult melanocyte stem cells and their differentiation. 

In closing, our findings add new understanding to how melanocyte 
stem cell and hair follicle stem cell behaviours maintain reliance upon 
cooperative factors within the niche. They also reveal how this com- 
munication might be selectively uncoupled in injury and disease states. 
Notably, Edn2 is induced upon ultraviolet irradiation and other stress 
conditions associated with increased pigmentation”*’. Although 
beyond the scope of the present study, testing the possible role of 
NFIB in skin cancers, wound repair and stress responses merit investi- 
gation, as does removing possible redundancy from other NFI members 
expressed by hair follicle stem cells. In the future, it will be interesting to 
see the extent to which Nfib downregulation will tip the balance from 
coupled to uncoupled states in health and disease. Our studies here 
emphasize the importance of endothelins as important messengers to 
uncouple melanocyte and hair follicle stem cell synchrony. 


METHODS SUMMARY 


Nfib'’" and Nfib-'~ mice were engineered by R.M.G.’s laboratory. Det-EGFP 
animals were generated by GENSAT. Histology and immunofluorescence were 
performed on cryosections (10-20 1m). For melanocyte markers, including DCT, 
TYRP1 and TYR, sections were permeabilized with 0.3% H»O, in cold methanol 
for 30 min at —20°C. Melanin staining was performed using a Fontana~Masson 
stain kit (MarketLab). For NFIB ChIP-seq and RNA-seq analyses, hair follicle stem 
cells were isolated by FACS with indicated antibodies. For RNA extraction, cells 
were sorted directly into TRIzol LS Reagent (Life Technologies) and RNA was 
isolated using Direct-zol RNA MiniPrep (Zymo Research) with DNase treatment 
to remove residual genomic DNA. A detailed description of the ultrasound-guided 
lentiviral injection procedure and production of high-titre lentiviruses is described 
elsewhere’*’. Transduced mice were confirmed by genotyping with EGFP primers: 
forward 5’-AATGGCCACAAGTTCAGC-3’, reverse 5’-TCGCCGATGGGG 
GTATTCT-3’. Positive mice were fed with doxycycline-containing chow from 
P21. Figures were prepared using ImageJ, Adobe Photoshop and Illustrator CS3. 
Graphing and statistical analyses were performed using Prism 5 (GraphPad Software). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Mice. P52-60 CD1 mice from Charles River laboratories were used for NFIB 
ChIP-seq experiments. Nfib™" and Nfib~/~ mice have been described'®*", as have 
transgenic K15-crePGR, knock-in Sox9-creERT2 and Rosa26!*STOP!"* YFP 
(RosaYFP) mice****, Dct-EGFP animals were generated by GENSAT project”. 
CreER was activated by intraperitoneal injection of P20-21 mice with 20mg ml ' 
tamoxifen (Sigma) in corn oil (Sigma) (1 mg per 10 g body weight per day). Cre- 
PGR was induced by RU486 treatment: 5-day intraperitoneal injection of 10 mg 
ml! RU486 (VWR) in sesame oil (sigma) (0.75 mg 10 g body weight per day), 
together with 2-week topical application with 4% RU486 in ethanol. 

All animals were maintained in an AAALAC-approved Comparative Bio- 

science Center (CBC) at The Rockefeller University and procedures were per- 
formed using IACUC-approved protocols that adhere to the standards of the 
National Institutes of Health. 
Histology and immunofluorescence. Embryos (<E16.5) or back skins were 
embedded in OCT (Tissue Tek), frozen on dry ice and stored at —80 °C. Only 
in the case of Rosa YFP reporter or lentiviral transduced mice, tissues were prefixed 
in 4% paraformaldehyde (PFA) for 30 min at room temperature before embedding 
in OCT in order to preserve the fluorescence signals. Un-prefixed frozen sections 
(10-20 um) were fixed in 4% PFA for 10 min at room temperature. For histological 
analysis, sections were stained with haematoxylin and eosin. Melanin staining was 
performed using a Fontana—Masson stain kit (MarketLab Inc.) according to the 
manufacturer’s directions. For immunofluorescence, sections were permeabilized 
in 0.3% Triton X-100 in PBS for 20 min and blocked for 1 h at room temperature in 
blocking buffer including 2.5% normal donkey serum, 2.5% normal goat serum (or 
5% normal donkey serum alone when goat primary antibodies were used), 0.5% 
BSA and 0.1% Triton X-100 in PBS. For melanocyte markers, including DCT, 
TYRP1 and TYR, sections were permeabilized with 0.3% HO, in cold methanol 
for 30 min at-20 °C**. MOM Basic kit (Vector Laboratories) was used for blocking 
when primary antibodies were generated from mouse. Primary antibodies were 
diluted in blocking buffer and sections were incubated overnight at 4°C. After 
washing with PBS for 30 min at room temperature, sections were incubated for 
1-2h at room temperature with secondary antibodies conjugated to Alexa-488, 
Alexa-546, Alexa-647 (Molecular Probes), or RRX (Jackson Laboratories). EdU 
staining was performed using Click-iT EdU Alexa Fluor 594 Imaging Kit (Life 
Technologies) and following manufacturer’s directions. Nuclei were stained using 
4'6'-diamidino-2-phenylindole (DAPI). Imaging was performed on a Zeiss 
Axioplan 2, Zeiss Apotome, Zeiss Inverted LSM 510 laser scanning confocal 
microscope, or Zeiss Inverted LSM 780 laser scanning confocal microscope. 
Figures were prepared using ImageJ, Adobe Photoshop and Illustrator CS3. 

The following antibodies and dilutions were used: NFIB (rabbit, 1:1,000, Active 
Motif); KIT (rat, 1:1,000, BD Pharmingen); P-cadherin (goat, 1:100, R&D); 
E-cadherin (rat, 1:500, Fuchs laboratory); K5 (guinea-pig, 1:500, Fuchs labor- 
atory); CD34 (rat, 1:100, BD Pharmingen); Ki67 (rabbit, 1:300, Novocastra); 
cleaved-caspases 3 (rabbit, 1:300, R&D); GFP (chicken, 1:2,000, Abcam); EDN 
1/2/3 (rabbit, 1:100, Santa Cruz); KITL (rat, 1:300, R&D); Mac-1 (rat, 1:100, 
eBioscience) MITF (mouse,1:100, Abcam); DCT (rabbit, 1:500, gift from V. J. 
Hearing); TYRP1 (rabbit, 1:1,000, gift from V. J. Hearing); TYR (rabbit, 1:1,000, 
gift from V. J. Hearing). 

Electron microscopy. Back skins were fixed in 2% glutaraldehyde, 4% PFA and 
2mM CaCl, in 0.05 M sodium cacodylate buffer, pH 7.2, at room temperature for 
>1h, post-fixed in 1% osmium tetroxide, and processed for Epon embedding. 
Ultrathin sections (60-70 nm) were counterstained with uranyl acetate and lead 
citrate. Images were taken with a transmission electron microscope (Tecnai G2-12; 
FEI) equipped with a digital camera (model XR60; Advanced Microscopy 
Techniques). 

Isolation of hair follicle stem cells and FACS. Subcutaneous fat was removed 
from skins with a scalpel. To isolate bulge hair follicle stem cells for NFIB ChIP-seq 
experiment, skins were placed on 0.25% trypsin (GIBCO) at 37 °C for 30 min with 
dermis side down. For isolation of bulge and hair germ hair follicle stem cells, skins 
were first incubated in 0.25% collagenase (Sigma) in HBSS (GIBCO) at 37 °C for 
1h to remove dermis. After gentle PBS washing, skins were treated with trypsin as 
described above. Single-cell suspensions were obtained by scrapping the skin 
gently and filtering through 70-m strainers, followed by 40-pm strainers. 
Staining buffer (PBS with 5% FBS treated with BioRad Chelex to remove calcium) 
was added to inactivate Trypsin, and cells were collected by centrifugation for 
5 min at 300g. Cell suspensions were incubated with the appropriate antibodies 
diluted in staining buffer for 30 min at 4°C. The following antibodies were used: 
CD34-Alexa 660 (1:200, eBioscience); «6-phycoerythrin (1:500, eBioscience); 
SCA1-Alexa 700 (1:200, eBioscience); CD200-biotin (1:200, AbD Serotec); strep- 
tavidin- phycoerythrin-Cy7 (1:200, eBioscience). DAPI (100 ng ml !) was used 
for death cell exclusion. Cell isolations were performed on BD FACSAria II sorters 
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equipped with BD FACSDiva software. For RNA extraction, cells were sorted 
directly into TRIzol LS Reagent (Life Technologies). 

RNA extraction and RNA-seq analysis. Cells were isolated from Nfib’* /K15- 
crePGR/RosaYFP (Het) and Nfib fl/fl/K15-crePGR/RosaYFP (cKO) mice. Nfib abla- 
tion was induced from second telogen (P50-60) with RU486 treatment, and mice 
were taken during third telogen (P120-130). The protocol of cell sorting was described 
above with antibodies: CD34—Alexa 660, CD200-biotin, «6-phycoerythrin, SCA1- 
Alexa 700 and streptavidin- phycoerythrin-Cy7. Cells were lysed with TRIzol LS 
Reagent (Life Technologies) and RNA was extracted using Direct-zol RNA MiniPrep 
(Zymo Research) with DNase treatment to remove residual genomic DNA. Isolated 
RNA samples were provided to the Genomics Resources Core Facility at Weill 
Cornell Medical College for quality control, library preparation, clustering and sequen- 
cing. RNA-seq reads were aligned to the mouse genome (mm9, NCBI Build 37) using 
TopHat” (http://tophat.cbcb.umd.edu/). Cufflinks* (http://cufflinks.cbcb.umd.edu/) 
was subsequently used to assemble the aligned reads into transcripts and then estimate 
the transcript abundances. Pathway analysis was performed using iPAGE” algorithm, 
included in the ChIPseeger framework”. 

Quantitative real-time PCR with reverse transcription. RNA isolation was 
described above. Complementary DNA was synthesized from isolated RNA using 
SuperScript III First-Strand Synthesis System with oligo-dT primers (Invitrogen). 
cDNAs were mixed with indicated primers and Power SYBR Green PCR Master 
Mix (Applied Biosystems), and quantitative PCR (qPCR) was performed on a 
Applied Biosystems 7900HT Fast Real-Time PCR system for 40 cycles. Specifi- 
city was confirmed by subsequent melting curve analysis or gel electrophoresis. 
Levels of PCR product were expressed as a function of peptidylprolyl isomerase B 
(Ppib). Primers were designed through Primer 3 and amplified products encom- 
passed exon/intron boundaries. The following primer sequences were used: Nfib 
forward 5’-ATGACCCATCCAGTCCTCAA-3’, reverse 5’-TTGAAGGAAAGG 
CTCTCCAA-3’; Det forward 5'-AGAGAAACAACCCTTCCACAGA-3’, reverse 
5'-CCAATGACCACTGAGAGAGTTG-3’; Kit forward 5'-GGGCTAGCCA 
GAGACATCAG-3’, reverse 5'-AGGAGAAGAGCTCCCAGAGG-3’'; Mitf for- 
ward 5'-TTGAAAACCGACAGAAGAAGC-3’, reverse 5'-TGGATGGGATA 
AGGGAAAGTC-3’; Tyrp1 forward 5’-GCCTCCAGTTACCAACACAGA-3’, 
reverse 5’-AACGGATCAGACAAGAAGCAA-3’'; Tyr forward 5'-CCAGAAG 
CCAATGCACCTAT-3’, reverse 5'-ATAACAGCTCCCACCAGTGC-3'; Sox10 
forward 5’-GACCAGTACCCTCACCTCCA-3’, reverse 5’-AGCCTCTCAGC 
CTCCTCAAT-3’; Ednrb forward 5'-CTCTGTTGGCTTCCCCTTC-3’, reverse 
5'-CGATTGGATTGATGCAGGA-3'; Pax3 forward 5'-GCGAGAAAAAGG 
CTAAACACA-3’, reverse 5’-CGGAGCCTTCATCTGACTG-3’; $100a1 forward 
5'-CAAGGAAGGGGACAAATATAAGC-3’, reverse 5’-CGTTTTCATCCAG 
TTCCTTCA-3’; Edn1 forward 5'-TACTTCTGCCACCTGGACATC-3’, reverse 
5'-CCCTGAGTTCTTTTCCTGCTT-3’; Edn2 forward 5'-TTCTGCCATCGAA 
GACACTG-3’, reverse 5’-TGGCCTTTCTTGTCACCTCT-3’; Edn3 forward 5'- 
TGGGAAACAAGAGGACAAGG-3’, reverse 5'-CTGGGAGCTTTCTGGAAC 
TG-3'; Axin2 forward 5'-ACTGACCGACGATTCCATGT-3’, reverse 5’-CTGC 
GATGCATCTCTCTCTG-3’; Gpnmb forward 5'-TGCCTGCTGTCTGTGAG 
AAG-3’, reverse 5’-GGCAGTTTCCTATTGGCTTG-3’; Kitl forward 5’-CAAG 
TCTTACAAGGGCAGTTGA-3’, reverse 5’-ACAAGGTCACGGGTAGCAAG- 
3’; Chd3 forward 5'-ACTTTGATGAGCGTCCTGAAG-3’, reverse 5'-GGCTT 
GTCCTTCTCATTTCG-3’; Whrn forward 5'-TGGCTTATAGACCTGATGGA 
GAA-3’, reverse 5'-CTTCTGAGGGGGATTTGACAT-3’; Acsbg1 forward 5'- 
AAGTTCCTGTCCATGCTGCT-3’, reverse 5'-TGGAGAAGTCACGTTGGAG 
A-3'; Adamtsl4 forward 5'-GGCAACCAGACTCTCAGCTC-3’, reverse 5’- 
CGGCAGCAGGTAGTTGTGTA-3’; Cyp4b1 forward 5'-GCCTGATCTCTCTG 
CACATCTA-3’, reverse 5'-CACCTTCATCTCGTTCATAGCA-3’'; Gfral for- 
ward 5’-TGCTGGCCCTCTAGATCCATAAC-3’, reverse 5’-ACAGCGCTTCT 
GGCAGTTGATA-3’; Myoc forward 5’-TCGGCTTTAGAGGAAGAGAAGA- 
3’, reverse 5’-CATACTTGCCAGCGATTGTTT-3’; Nt5e forward 5'-ATGAA 
CATCCTGGGCTACGA-3’, reverse 5’-GTCCTTCCACACCGTTATCAA-3’; 
Steap4 forward 5'-TGATTCCTATCCGTTACTATGTTCG-3’, reverse 5’-ATG 
GGCTGTCTTTATTAGTTAGGG-3’; Tns1 forward 5'-TCTTACCATTGCC 
CTCAATCC-3’, reverse 5'-CCACGGACTCCACATAGCTC-3’; Ppib forward 
5'-GTGAGCGCTTCCCAGATGAGA-3’, reverse 5'-TGCCGGAGTCGACAAT 
GATG-3’. 

NFIB ChIP-seq analysis. All materials and methods for ChIP-seq experiments have 
been described”. For each independent NFIB ChIP experiment, 10-15 million bulge 
hair follicle stem cells were isolated by FACS (CD34—Alexa 660 and «6-phycoerythrin 
antibodies) from 15-20 CD-1 mice, and then used to isolate chromatin and incubated 
with anti-NFIB antibody (Active Motif) for ChIP-seq analysis. ChIP-seq reads were 
aligned to the mouse genome (mm4, build 37) with the Bowtie program” and the 
Illumina Analyzer Pipeline. ChIP-seq peak calling, genomic annotation of NFIB peaks, 
and comparison between NFIB targets and gene expression signature genes were 
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performed using ChIPseeger*’. Motif analysis was performed using FIRE” algorithm, 
included in the ChIPseeger framework. 

Plasmid DNA constructions. Plasmids encoding full-length Nfib1 cDNA were 
obtained from ATCC (clone MGC-13959). Since NFIB3 is the major isoform 
present in epidermal keratinocytes, Nfib3 cDNA was used in this study. Nfib3 
DNA fragment was amplified from Nfibl cDNA clone by PCR with primers: 
forward 5'-GITTGCGAGCTCTCATGATGTATTCTC-3’, reverse 5’-GTCAACC 
CGGGCTAGCCCAGGTACCAGGACTGGCTCGTTTGAGGA-3’. 

For expression of EGFP-NFIB3, Nfib3 DNA fragment was then inserted into 
pEGEP-C1 between SacI and Xmal restriction sites. All lentiviral vectors (LV) were 
present in the pLKO lentiviral backbone (pLKO-no-stuffer-PGK-MCS, Addgene 
10879). For construction of LV-rtTA-IRES-EGFP, DNA fragment of rtTA-IRES- 
EGFP was excised from pMSCV-rtTA2S-M2-IRES-EGFP with BglII and Sall 
restriction enzymes (RE) and cloned between BamHI and Sall sites of pLKO 
backbone. rfTA2S-M2 (Tet-On Advanced) was cloned from pUHD-rtTA2S-M2. 
This was excised by digesting with BamHI and Sacll, and blunting with Klenow. 
The blunted fragment was subcloned into pMSCV-IRES-EGFP digested with 
Hpal, and dephosphorylated with SAP. For construction of LV-TRE-PGK- 
H2BmRFP1, TRE promoter was first inserted into Clal and Agel sites of pLKO 
vector to replace U6 promoter (pLKO-TRE-PGK). The TRE promoter from 
Clontech’s pTRE2 was amplified by PCR using primers flanked with Clal and 
Agel sites: forward 5'-CGTATATCGATGCCCTTTCGTCTCGA-3’, reverse 5’- 
GAATTACCGGTCGCGGAGGCTGGAT-3’. 

H2BmRFP1 DNA fragment was obtained from pCR-H2BmRFP1 by BglII and 
Nsil restriction enzyme digestion’®, which was then inserted between BamHI and 
Nsil sites of pLKO-TRE-PGK. Edn2 cDNA were amplified from keratinocyte 
cDNA library with primers flanked with EcoRI site: forward 5’-CGCCAGAA 
TTCATGGTCTCCGCCTGGT-3’, reverse 5’-CGCCAGAATTCGGTGTTATC 
TCTTCCTCCATC -3’. 

Afterwards, DNA fragment was inserted into EcoRI site of LV-H2BmRFP1-TRE 
vector. All insertions were verified by DNA sequencing. 

A detailed description of the ultrasound-guided lentiviral injection procedure 
and production of high-titre lentiviruses is described elsewhere*’. Transduced 
mice were confirmed by genotyping with EGFP primers: forward 5'-AATGG 
CCACAAGTTCAGC-3’, reverse 5'-TCGCCGATGGGGGTATTCT-3’. Positive 
mice were fed with doxycycline-containing chow from P21. 

Cell culture and in vitro Nfib overexpression. Primary Nfib-null (KO) kerati- 
nocytes were isolated from epidermis of E18.5 Nfib knockout embryos (Nfib ‘~) 
and control keratinocytes were from heterozygous Nfib*~ embryos. Kerati- 
nocytes were maintained in E-media supplemented with 15% serum and a final 
concentration of 0.05 mM Ca. Experiments were performed using primary cells 


with fewer than 15 passages. Plasmids expressing EGFP or EGFP-NFIB3 were 
introduced into control and KO keratinocytes with FuGENE 6 Transfection 
Reagent (Roche). After 24h transfection, EGFP-positive cells were isolated by 
FACS for qPCR analysis. 

Intradermal injection of EDNRB receptor inhibitor BQ788 and anti-KIT 
blocking antibody. To inhibit EDNRB receptor from late catagen, 100 ul 
BQ788 dissolved in PBS (1 mg ml ~ ' Tocris Bioscience) or PBS alone was injected 
intradermally into the back skin of mice on first catagen when skin colour turned 
from black to pink’. To inhibit KIT signalling, 100 pl anti-KIT antibody (BD 
Pharmingen, 1.0 mg ml, clone: ACK45 NA/LE) or control Rat IgG was used”. 
Injections were conducted every other day for 3 days (day 0, 2, 4), and mice were 
taken and analysed on day 5. For inhibition of KIT signalling during anagen, 100 pl 
anti-KIT antibody or control IgG was injected intradermally into back skin 1 day 
after depilation. Injections were conducted every other day for 5 days (day 0, 2, 4, 6, 
8) and mice were taken and analysed on day 10. 

Statistics. To determine significance between two groups indicated in figures, 
comparisons were performed in Prism 5 (GraphPad Software) with unpaired 
two-tailed student’s t-test. 
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‘See-saw’ expression of microRNA-198 and FSTLI 
from a single transcript in wound healing 


Gopinath M. Sundaram!*, John E. A. Common", Felicia E. Gopal’, Satyanarayana Srikanta’, Krishnaswamy Lakshman’, 
Declan P. Lunny’', Thiam C. Lim**, Vivek Tanavde’”, E. Birgitte Lane’ & Prabha Sampath"’ 


Post-transcriptional switches are flexible effectors of dynamic changes 
in gene expression’. Here we report a new post-transcriptional switch 
that dictates the spatiotemporal and mutually exclusive expression of 
two alternative gene products from a single transcript. Expression of 
primate-specific exonic microRNA-198 (miR-198)’, located in the 3’- 
untranslated region of follistatin-like 1 (FSTL1)? messenger RNA, 
switches to expression of the linked open reading frame of FSTL1 
upon wounding in a human ex vivo organ culture system. We show 
that binding of a KH-type splicing regulatory protein (KSRP, also 
known as KHSRP) to the primary transcript determines the fate of 
the transcript and is essential for the processing of miR-198: trans- 
forming growth factor-B signalling switches off miR-198 expression 
by downregulating KSRP, and promotes FSTLI protein expression. 
We also show that FSTL1 expression promotes keratinocyte migra- 
tion, whereas miR-198 expression has the opposite effect by targeting 
and inhibiting DIAPH1, PLAUand LAMC2Z. A clear inverse correla- 
tion between the expression pattern of FSTL1 (pro-migratory) and 
miR-198 (anti-migratory) highlights the importance of this regula- 
tory switch in controlling context-specific gene expression to orches- 
trate wound re-epithelialization. The deleterious effect of failure of 
this switch is apparent in non-healing chronic diabetic ulcers, in which 
expression of miR-198 persists, FSTL1 is absent, and keratinocyte 
migration, re-epithelialization and wound healing all fail to occur. 
Wound healing requires the regulated integration of complex bio- 
logical events including cell migration, proliferation and extracellular 


matrix remodelling’, globally stimulated by transforming growth 
factor-B (TGF-B) and other growth factors’. Gene expression in 
this regenerative process is extensively regulated by non-coding 
microRNAs (miRNAs)°. We compared miRNA-expression profiling 
from cutaneous wounds, using a human skin ex vivo organ culture 
system sampled at 0 h and 24h after injury, and found miR-198 to bea 
consistent and significant differentially expressed miRNA. Primate- 
specific miR-198 belongs to a small cohort of human exonic miRNA 
stem-loops (Fig. la and Supplementary Fig. 1) and is located within 
the eleventh exon of the protein-coding FSTL1 (ref. 3) gene. At steady 
state, healthy epidermis expresses high levels of miR-198 (ref. 7), but 
upon injury, miR-198 levels plummet as early as 3h after wounding 
and remain low for at least 24 h (Fig. 1b). In situ hybridization reveals 
expression of mature miR-198 in epidermal keratinocytes in normal 
skin, but not in epidermis 24h after injury (Fig. 1c). 

miR-198 lies within an exon of the FSTL1 gene, therefore we inves- 
tigated whether miR-198 and FSTL1 were being co-regulated. In con- 
trast to miR-198, FSTL1 mRNA and protein were observed in the 
wound-edge epidermis 24 h after injury, but were not seen in unwounded 
epidermis (Fig. 1d, e and Supplementary Fig. 2). A reducing gra- 
dient of miR-198 intensity, and a corresponding appearance of 
FSTL1 protein, was observed from the wound edge out to about 
12 mm (Supplementary Fig. 3). A single transcript can therefore func- 
tion either as a primary miRNA (pri-miRNA) transcript making 
miR-198 or as an MRNA® producing FSTL1 protein, suggesting a 
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Figure 1 | Expression of an exonic miRNA or linked open reading frame in 
context-specific physiological states. a, Schematic diagram of the FSTL1 gene 
on chromosome (chr) 3 (NM_007085), showing the exon-intron boundaries 
and the 3’-UTR sequence encoding miR-198 (red indicates mature miR-198 
sequence and green indicates the GUG motif in the terminal loop of the 
precursor miR-198 (pre-miR-198) sequence). b, miR-198 expression by qRT- 
PCR in organ culture explants at indicated time points after injury. Student’s 
t-test was used to calculate the P value and error bars denote + s.e.m. 
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**P < 0.001. ¢, In situ hybridization with locked nucleic acid (LNA) probes 
specific for mature miR-198 on normal skin at 0h (left) and 24h (right) after 
injury (n = 5). miR-198 is stained red and nuclei are stained blue. 

d, Representative semiquantitative RT-PCR analysis of FSTL1 mRNA in 
explant samples before and after injury at indicated time points. Ribosomal 
large subunit PO (RPLPO) was used as a control. e, Immunohistochemical 
localization of FSTL1 protein at 0h (left) and 24h (right) after wounding 

(n = 5). Scale bars, 100 Lm. 
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post-transcriptional switch regulating context-specific expression of 
these diverse products of a single gene. 

There was no significant change in expression of the precursor FSTL1 
mRNA (pre-FSTL1 mRNA) synthesized from the DNA template after 
transcription, confirming that regulation was post-transcriptional 
(Supplementary Fig. 4a, b). Detection of low levels of FSTL1 mRNA 
at 0 h after injury (Supplementary Fig. 4c), with transcript localization 
in the nucleus and absence of FSTLI protein, indicated that FSTL1 
mRNA functions as a pri-miRNA transcript and is processed to 
form mature miR-198 (Supplementary Fig. 4d). Yet at 24h after 
injury, the FSTL1 transcript is functioning as an mRNA, as confirmed 
by the abundant FSTL1 protein (Supplementary Fig. 4d). The pres- 
ence of the pre-miR-198 in the 3’-untranslated region (UTR) of FSTL1 
mRNA does not inhibit FSTL1 protein expression, as confirmed by 
chimaeric luciferase assays (Supplementary Figs 4e, f). In conclusion, 
if the FSTL1 transcript is not processed to miR-198, the full-length 
transcript is exported to the cytoplasm and functions as FSTL1 
mRNA. 

FSTL1, a member of the BM-40/SPARC/osteonectin family, is a 
glycoprotein originally cloned from an osteoblastic cell line as a 
TGF-B1-induced gene*”. Expression of FSTL1 protein in the wound- 
edge epidermis (Fig. le) prompted us to study the role of FSTL1 in 
wound healing. Keratinocytes transfected with small interfering RNA 
(siRNA) against FSTL1 or control non-targeting siRNAs were grown 
to confluent monolayers and subjected to scratch-wounding. Knock- 
down of FSTLI (Supplementary Fig. 5a, b) significantly suppressed 
migration of keratinocytes (Supplementary Fig. 5c, d), demonstrating 
a new role for FSTL1 in promoting keratinocyte migration. Knock- 
down of FSTL1 did not affect miR-198 expression (Supplementary 
Fig. 5e). Comparison of keratinocyte gene-expression profiles in the 
presence and absence of FSTL1 revealed differentially expressed genes 
to include CXCL10 (ref. 10) and FERMT2 (ref. 11), both essential for 
migration, suggesting pathways by which FSTLI may increase kerati- 
nocyte migration (Supplementary Fig. 5f). 

By contrast, miR-198 expression in steady-state epidermis, and 
its downregulation in activated migrating epidermal keratinocytes’ 
at the wound edge, suggests a function for miR-198 in inhibiting 
keratinocyte migration. Scratch-wound assays were therefore used to 
examine migration in keratinocyte monolayers overexpressing miR- 
198 (Supplementary Fig. 6a): these cells only closed 35 + 10% of the 


scratch-wound area in 24h, without affecting proliferation, compared 
to complete closure observed in controls (Supplementary Fig. 6b-d 
and Supplementary Videos 1 and 2). Although FSTL1 mRNA itself has 
a binding site for miR-198, overexpression of miR-198 did not affect 
FSTLI expression (Supplementary Fig. 6e), as verified by luciferase 
assays (Supplementary Fig. 6f). Together, these results suggest that 
miR-198 inhibits keratinocyte migration independently of FSTL1. 

We then sought to trace the miR-198 inhibitory regulatory network 
and identify specific regulators of keratinocyte migration by compar- 
ing cultured keratinocyte gene-expression profiles in the presence or 
absence of miR-198. Representation of selected array data as a heat- 
map revealed potential targets of miR-198, which were validated by 
quantitative reverse-transcriptase PCR (qRT-PCR) (Supplementary 
Fig. 6g, h). Further targets of miR-198 were analysed in a human skin 
organ culture model. At steady state, when epidermal keratinocytes 
express high levels of miR-198, these candidate target genes are down- 
regulated. Upon injury, expression of these target genes rises as 
miR-198 expression drops, supporting their potential regulation by 
miR-198 (Fig. 2a). Interrogation of the TargetScan v5.1 database and 
correlation with microarray data revealed direct targets, that is with at 
least one binding site each for miR-198, including urokinase-type 
plasminogen activator (PLAU), a serine protease that degrades extra- 
cellular matrix’; diaphanous homologue 1 (DIAPH1), involved in 
actin polymerization’; and laminin y2 chain (LAMC2), an essential 
component of the basement membrane protein laminin 332 (ref. 15). 
Significant downregulation of luciferase reporter activity with co- 
transfection of miR-198 and wild-type PLAU, DIAPH and LAMC2 
3'-UTRs, but not with mutant 3’-UTRs, confirmed these are direct 
targets of miR-198 (Fig. 2b and Supplementary Fig. 7). Knockdown of 
DIAPH1, PLAU and LAMC2 using gene-specifc siRNA (Supplemen- 
tary Fig. 8a-f) significantly suppressed migration of keratinocytes, 
which only closed 46 + 16%, 59 + 8% and 40 + 6% of the scratch- 
wound area in 24 h, compared to complete wound closure seen in 
control-transfected keratinocytes, thus phenocopying the effect of 
miR-198 (Supplementary Fig. 8 g-j). Although we observed a mod- 
erate rescue of the effect of miR-198 with ectopic overexpression of 
individual targets, when taken in combination the rescue is highly 
significant (Supplementary Fig. 9). 

Protein expression from these target genes was tested by immuno- 
histochemistry in normal healthy skin. A substantial increase in 
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Figure 2 | The regulatory switch is impaired in chronic wounds. a, Heat- 
map of selected genes (putative targets) generated from microarray using RNA 
from skin (organ culture) at 0 and 24h after injury. Expression values are 
displayed in shades of red (low) or blue (high) relative to the individual mean 
value of the gene in a linear scale. b, Validation of direct targets of miR-198 by 
luciferase reporter assays. Cells were co-transfected with wild-type (WT) or 
mutant (Mut) 3’-UTR luciferase reporter constructs as indicated, and either 
with miR-198 or non-targeting scrambled control. Normalized relative 
luciferase activities are shown as a bar diagram. Luciferase activity is expressed 
as mean relative to controls (n = 3). **P< 0.001. Student’s t-test was used to 
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calculate P value and error bars denote mean = s.e.m. c, Immunohistochemical 
analysis of PLAU, LAMC2 and DIAPH1, on sections from organ culture and 
chronic wounds. A substantial increase in protein expression of target genes is 
clearly observed at 24h after injury (middle panels). However, in chronic 
wounds (right panels), the expression of target genes remains relatively low 
(n = 8). d, Expression of FSTL1 protein and miR-198 detected by 
immunohistochemistry (left panels) and in situ hybridization (right panels) 
respectively, in normal wounds 24h after injury or in chronic wounds (n = 8). 
Scale bars, 100 ptm. 
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expression of these proteins was observed 24h after wounding, val- 
idating the mRNA-expression profiles (Fig. 2c, left and middle panels). 
PLAU, LAMC2 and DIAPH1 proteins were detected in epidermis at 
the wound edge, but not in unwounded epidermis (Fig. 2c). Thus we 
conclude that miR-198 restrains keratinocyte migration in steady-state 
epidermis by suppressing genes that are important for migration. 

We then investigated expression of these targets in chronic non- 
healing ulcer wounds from patients with diabetes mellitus, in which 
wound healing is defective, and observed lower expression of PLAU, 
DIAPH1 and LAMC2 (Fig. 2c, right panel) than in healing wounds. 
Persistent high levels of miR-198 were detected at the chronic-wound 
edge (n = 8), suggesting an explanation for the absence or low levels of 
its target gene products (Fig. 2d). Furthermore, the absence of FSTL1 
protein at the wound edge suggests that the FSTL1-miR-198 switch 
described above is non-functional in chronic wounds (Fig. 2d and 
Supplementary Fig. 10). 

In an attempt to understand the regulation of the post-transcrip- 
tional switch that directs the transcript to function as a pri-miRNA or 
an mRNA, we analysed the sequence of the pre-miR-198 in the 3’- 
UTR of the FSTL1 transcript. The presence of a GUG motif (Fig. 1a) 
within the terminal loop suggests that miR-198 belongs to a cohort of 
miRNAs processed by KSRP that binds to the specific G-rich motif"®. 
RNA-binding-protein immunoprecipitation (RIP) (Fig. 3a and Sup- 
plementary Fig. 1la) and RNA gel retardation assays (Fig. 3b and 
Supplementary Fig. 11b) confirmed binding of KSRP to the GUG 
motif of pre-miR-198 in the 3’-UTR of FSTL1 mRNA in normal 
epidermal keratinocytes. Formation of an RNA-protein complex with 
the wild-type probe (GUG motif), but not with a mutant probe (CUC 
motif), confirms the specificity of binding. Addition of recombinant 
KSRP to an in vitro miRNA processing assay yielded a significant 
increase in mature miR-198 with wild-type (GUG motif) but not with 
mutant (CUC motif) sequences, confirming the role of KSRP in miR- 
198 processing (Supplementary Fig. 12a, b). 

In loss-of-function assays using a gene-specific siRNA against 
KSRP, detection of low levels of mature miR-198 with a corresponding 
increase in FSTL1 mRNA supports the essential role of KSRP in 
switching on miR-198 processing (Fig. 3c). This suggests that the mere 
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presence of a miR-198 precursor stem-loop in cis cannot efficiently 
induce nuclear miR-198 processing. Adding another layer of com- 
plexity, we demonstrate that processing of exonic miR-198 is depen- 
dent ona trans-acting RNA-binding protein, KSRP”®. In the absence of 
KSRP, miR-198 processing fails and FSTL1 transcript now functions 
as an mRNA, resulting in expression of the FSTL1 protein. 

TGF-B1 (but not FGF2 or PDGF-AB) can regulate this switch, 
suppressing miR-198 generation by downregulating KSRP produc- 
tion, to indirectly promote FSTL1 expression (Fig. 3d and Supplemen- 
tary Fig. 13a—d). Increase in transcript abundance of miR-198 targets 
and FSTL1 are independent of de novo transcription, as was con- 
firmed by actinomycin-D inhibition of transcription (Supplemen- 
tary Figs 14a, b and 15a, b). Thus TGF-f1 signalling promotes 
transcript stability of FSTL1 and miR-198 targets. 

Blocking the processing of miR-198 using siRNA against DROSHA 
leads to an increase in transcript abundance of miR-198 targets and 
FSTL1 (Supplementary Figs 14c and 15c, d). Functional inhibition of 
miR-198 with anti-miR-198 results in increased protein expression of 
targets (Supplementary Fig. 14d). However, no change in FSTLI 
expression with anti-miR-198 (Supplementary Fig. 15e) confirms that 
FSTL1 is not a target of miR-198. In summary, TGF-B1 promotes the 
stability of FSTL1 by downregulating KSRP, which is essential for miR- 
198 processing, providing a mechanism for post-transcriptional regu- 
lation of FSTL1 and miR-198-target mRNAs. 

The KSRP 3'-UTR has a highly conserved binding site for miR- 
181a'’, a TGF-B1-induced miRNA (Supplementary Fig. 16a). A con- 
comitant downregulation of KSRP suggests a link between TGF-B1 
and KSRP mediated by miR-181a. Detection of miR-181a as early as 
3h after injury in explant wound assays suggested that repression of 
KSRP by TGF-B1 is potentially mediated by miR-181a (Supplemen- 
tary Fig. 16b). Significant downregulation of luciferase activity upon 
co-transfection with miR-181a and wild-type (but not mutant) KSRP 
3'-UTR confirmed that KSRP is a direct target of miR-18la (Sup- 
plementary Fig. 16c). This link between KSRP and TGF-f1 is further 
supported by the detection of KSRP in normal epidermal keratinocytes 
in the absence of TGF-f1 (Fig. 3e, upper panel). However, upon injury 
TGF-fB1 downregulates KSRP (Fig. 3e, middle panel), thus facilitating 
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miR-198 or FSTL1. a, RNA immunoprecipitation from keratinocyte lysates 

using anti-KSRP antibody or IgG control. RT-PCR showing fold enrichment 
of transcripts reveals specific binding of KSRP protein to FSTL1 mRNA but not 
to the most abundant mRNA species, KRT14 (n = 3). b, RNA gel retardation 
assay showing binding of KSRP to the GUG motif in the loop of the pre-miR- 
198 transcript and abrogation of binding with mutant CUC motif in this site 

(Mut-1). Mutation of the GG motif in the stem of the pre-miR-198 transcript to 
CC results only in a modest loss of binding (Mut-2). RNA-protein complex was 


c, Histogram representing relative transcript abundance in keratinocytes 
transfected with control or gene-specific siRNA against KSRP (n = 3). 

d, Histogram representing relative transcript abundance in keratinocytes 
treated with TGF-B1 or control (n = 3). Student’s t-test was used to calculate P 
value. Error bars denote mean = s.e.m. **P < 0.001. e, Immunohistochemistry 
with TGF-B1 (left panel) and KSRP antibody (right panel) on normal skin 
explants and chronic-wound sections (n = 5). Scale bars, 100 um. 
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the expression of FSTL1. This suggests that the observed low levels of 
TGF-B1 and absence of TGF-f1 receptors in chronic wounds'*"? may 
directly contribute to constitutive expression of miR-198. In support of 
this argument, we observe higher expression of KSRP and low levels of 
TGF-B1 in chronic wounds (Fig. 3e, lower panel). Thus the dysfunc- 
tional FSTL1-miR-198 switch that is the potential cause of impaired 
keratinocyte migration and loss of re-epithelialization culminating 
in non-healing chronic wounds may be a consequence of defective 
TGF-B signalling. 

Keratinocyte migration and re-epithelialization is essential for 
wound healing, when rapid wound closure is critical to restore barrier 
function and integrity of skin*’*'. Data presented here describe a regu- 
latory molecular switch that directly controls keratinocyte migration. 
At homeostasis a cohort of epidermal genes required for keratinocyte 
migration are kept suppressed by miR-198. Inappropriate expression 
of miR-198 may not only restrict migration” but could also contribute 
to decreased fibrinolysis and impaired matrix deposition in chronic 
diabetic ulcers”**** by inhibiting the expression of PLAU and LAMC2. 
At the non-healing edge of diabetic ulcers, keratinocytes proliferate**”® 
but fail to migrate. Our data suggest that failure to switch off miR-198 
and absence of FSTLI results in failure of these wounds to heal: by 
inhibiting multiple genes involved in various facets of keratinocyte 
migration, miR-198 may effectively prevent cell migration and re- 
epithelialization in chronic wounds (Supplementary Fig. 17). 

The FSTL1-miR-198 ‘see-saw’ seems to be a unique regulatory 
switch critical for wound healing, and in its elucidation we identify 
miR-198 as a potential molecular biomarker for non-healing wounds. 
Chronic wounds in patients with diabetes mellitus are a major global 
health burden and the most common cause of lower extremity ampu- 
tations’’”’*. Targeting miR-198 and modulation of the defective 
switch in wound-healing disorders pose exciting possibilities as com- 
binatorial topical therapeutics to improve patient outcomes. 


METHODS SUMMARY 
Human skin and ex vivo organ culture and injury assay. Ex vivo injury assays 
were carried out using a human skin organ culture model system as described 
previously’’. Tissue samples were processed for histology or RNA analysis at the 
indicated time points after injury. For immunohistochemistry, tissue samples were 
fixed before dehydration and embedded in paraffin blocks. Skin biopsies were 
excised from the margins of chronic non-healing ulcers of diabetic patients, before 
scheduled amputations. Samples were processed as above for immunohisto- 
chemistry. Chronic-wound biopsy samples were collected from patients (n = 8) 
between the ages of 45 and 55 years old. 

This study was approved by the local, domain-specific ethical review board in 
accordance with the Declaration of Helsinki. All participants gave written, 
informed consent. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Human skin and ex vivo organ culture and injury assay. Ex vivo injury assays 
were carried out using a human skin organ culture model system as described 
previously’’. Skin biopsies were excised from the margins of chronic non-healing 
ulcers of diabetic patients, before scheduled amputations. Chronic-wound biopsy 
samples were collected from patients (n = 8) between the ages of 45 and 55 years old. 
Cell culture. N/TERT-1 keratinocytes derived from normal human epidermal 
keratinocytes and immortalized with the telomerase catalytic subunit were used 
for experiments. Cells were cultured in keratinocyte serum-free medium (Life 
Technologies) supplemented with 0.2 ng ml ' of epidermal growth factor (EGF), 
25 ug ml! of bovine pituitary extract, 0.4 mM CaCl, and penicillin/streptomycin. 
Cell cultures were maintained at low confluence to prevent differentiation and 
passaged before they reached 40% confluence. 

Lenti-X 293T cells (Clonetech) were cultured in DMEM with 10% fetal calf 
serum containing penicillin/streptomycin. 
Antibodies. Antibodies used in this study are as follows: goat anti-FSTL1 antibody 
(Abcam), rabbit anti-DIAPH1 antibody (Cell Signaling), mouse anti-LAMC2 
antibody (Santa Cruz Biotechnology), rabbit anti-PLAU (Abcam), mouse anti- 
TGF-B1 (Novocastra), rabbit anti-KSRP antibody (Bethyl Laboratories) and 
chicken anti-goat Alexa 488 (Molecular Probes, Invitrogen). 
RNA isolation from skin. Total RNA from skin samples was isolated by a two- 
step RNA-isolation method. Skin biopsies were homogenized in TRIzol and RNA 
was extracted as per the manufacturer’s protocol (Life Technologies). Precipitated 
RNA was further subjected to column purification (Exiqon) and subjected to 
quality-control analysis by NanoDrop (Thermo Scientific) and Bioanalyzer 
(Agilent Technologies) to assess the concentration and integrity of isolated RNA. 
Quantification of miRNA and mRNA. Total RNA (50 ng) was reverse tran- 
scribed using miRNA-specific primers (Applied Biosystems, Life Technologies) 
according to TaqMan miRNA Reverse Transcription Kit (Life Technologies). 
Real-time analysis of miRNA expression was carried out using TaqMan probes. 
Cycle threshold (CT) values of miRNAs were normalized against U6 small nuclear 
RNA internal control and values plotted as relative transcript abundance. Total 
RNA (500 ng) was reverse transcribed with Superscript III (Life Technologies) and 
anchored oligodT primer as per the manufacturer’s instructions. Transcript levels 
were measured by qRT-PCR using SYBR Green PCR Master Mix (Applied 
Biosystems, Life Technologies) using gene-specific primers listed in Supplemen- 
tary Table 1. CT values were normalized to endogenous RPLPO values, which have 
been specifically recommended for use in keratinocytes. For the detection of 
FSTL1 mRNA by semiquantitative RT-PCR, the PCR amplification cycles were 
limited to 28. All experiments were performed in three biological replicates and 
representative figures are shown. 
Microarray analysis. N/TERT-1 keratinocytes were transfected with 50nM of 
negative control or miR-198 mimics using DharmaFECT transfection reagent 
according to manufacturer’s protocol. Forty-eight hours after transfection, total 
RNA was isolated using the Exigon miRCURY RNA Isolation Kit. Two-hundred- 
and-fifty nanograms of total RNA was converted into biotinylated antisense RNA 
(cRNA) using a TargetAmp Nano-g Biotin-aRNA Labelling Kit (Epicentre). 
Seven-hundred-and-fifty nanograms of biotinylated cRNA was hybridized to 
HT-12 v4 expression bead chip (Illumina) using samples in triplicate. Hybridi- 
zation, washing and scanning were performed according to the manufacturer’s 
protocol. Data extracted were normalized and analysed using Illumina Bead- 
Studio. Similar global expression profiling by microarray was performed on 
N/TERT-1 keratinocytes transfected with siRNA against human FSTLI at 48h 
after transfection. Microarray data have been deposited in the Gene Expression 
Omnibus database under accession numbers GSE37967 and GSE41615. 
In vitro scratch-wounding assays. N/TERT-1 keratinocytes were transfected 
with miRIDIAN miRNA mimics at 50% confluence (Dharmacon). As a control, 
a non-targeting sequence-encoding Caenorhabditis elegans miR-239b with no 
known human targets was used. Cells were transfected with 50 nM miRNA mimic 
using Dharmafect 1 according to the manufacturer’s protocol. Transfection effi- 
ciency was assessed on the basis of co-transfection of SiGLO Red transfection 
indicator. Using an IncuCyte wound maker, scratch wounds were made simulta- 
neously in all culture wells 48 h after transfection and scratch area was monitored 
over 24h using IncuCyte live-cell imaging system (Essen BioScience). Scratch- 
wound results were compiled from six wells with one scratch in each well. At the 
24-h time point, closure of the control scratch wound was observed. To knock- 
down FSTL1, DIAPH1, PLAU and LAMC2, 50 nM of smart-pool siRNA against 
the open reading frame sequence (Dharmacon) was used to transfect keratinoc- 
tyes. A non-targeting siRNA was transfected as a negative control. Transfection 
was performed using Dhramafect 1 transfection reagent. For rescue experiments 
using target gene overexpression, 100ng of DIAPH1, LAMC2 and PLAU open 
reading frame or an empty vector construct (Origene) was co-transfected along 
with miR-198 mimic using FuGENE 6 transfection reagent. Scratch wounds were 
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created 48 h after transfection and wound closure was measured by IncuCyte live- 
cell imaging system. 

Inhibition of miR-198/DROSHA and effect on target gene expression. 5’ 
fluorescein-labelled, phosphorothioate-modified power inhibitors against mature 
miR-198 (anti-miR-198), or a negative control inhibitor, were purchased from 
Exiqon. N/TERT-1 cells were transfected with 50nM of the inhibitors using 
Dharmafect 1 transfection reagent. Forty-eight hours after transfection, target 
proteins were subjected to immunocytochemistry using respective antibodies. 
For the knockdown of DROSHA, N/TERT-1 cells were transfected with smart- 
pool siRNAs against human DROSHA and total RNA was isolated at 3 days after 
transfection. 

Proliferation assays. N/TERT-1 keratinocytes were monitored by image analysis 
for 48h after transfection. Cells were transfected at low density, and 24h after 
transfection imaging was carried out in subconfluent states using IncuCyte live-cell 
imaging system (Essen BioScience). Proliferation was calculated by IncuCyte soft- 
ware algorithm to output a proliferation index corresponding to change in conflu- 
ence of each well. These measurements are an average of six wells for each treatment. 
Immunohistochemistry. Five-micrometre tissue sections were mounted on poly- 
L-lysine-coated glass slides (Thermo Scientific). After de-paraffinization, endo- 
genous peroxidase was quenched by immersing the slides in 3% hydrogen 
peroxide in methanol. After incubation in 10% goat serum, slides were incubated 
with the primary antibody. Slides were subjected to thorough rinsing before 
incubation with species-matched secondary horseradish peroxide (HRP)-labelled 
polymer antibodies (Dako). Chromogen 3,3'-diaminobenzidine (Dako) was used 
as substrate for colour development. Slides were counterstained with haematox- 
ylin before dehydration and mounting with DPX (Sigma). Images were acquired 
on a Zeiss Axioimager microscope. 

Immunocytochemistry. Keratinocytes cultured on coverslips were fixed using 
cold acetone methanol. Coverslips were submerged in 5% BSA in PBS to block 
non-specific adherence and incubated with primary antibodies overnight at 4 °C. 
Cells were rinsed in PBS before incubation with species-matched secondary 
antibodies conjugated to Alexa probes (Molecular Probes, Invitrogen). After coun- 
terstaining with DAPI (4’,6-diamidino-2-phenylindole; 100 ng ml !), cells were 
mounted using FluorSave mounting media and image acquisition was done in 
Olympus FluoView FV 1000. 

miRNA in situ hybridization. Five-micrometre tissue sections were incubated 
with LNA probes (5’-DIG-labelled LNA probes specific for miR-198 or scrambled 
probe with no homology to known vertebrate miRNAs (Exigon)) in hybridization 
buffer (Roche) at 51°C for 4h. After stringent washes, sections were blocked with 
10% goat serum and further incubated with anti-DIG alkaline phosphatase 
(Roche) overnight at 4 °C. Sections were washed in PBS-T (0.1%) and miRNA- 
bound LNA probes were detected by Fast Red Substrate (Panomics). After coun- 
terstaining with DAPI, slides were mounted using FluorSave (Merck). Image 
acquisition was performed with Olympus FluoView FV1000 using TRITC filter. 
LNA probes designed for mature miRNAs do not recognize the precursor or 
primary miRNAs. 

Luciferase assay. For the generation of chimaeric constructs with the target 3’- 
UTRlinked to a firefly reporter gene, the 3’- UTR fragment spanning the miR-198- 
binding site was amplified from the parental 3’-UTR clone (GeneCopoeia) using 
specific primers and subcloned into the multiple cloning site of pmirGLO Dual- 
Luciferase miRNA-target expression vector. 

The FSTL1 3'-UTR with (2.7-kb full-length 3’-UTR) and without pre-miR-198 
stem-loop (0.9-kb truncated 3'-UTR) were amplified and subcloned from the 
parental full-length construct (Origene). The chimaeric constructs were co- 
transfected with miR-198 or a non-targeting negative control mimic using 
Effectene (Qiagen) in 293T cells. Firefly and Renilla luciferase activities were 
measured 48h after transfection using a Dual-Luciferase reporter assay system 
(Promega). The firefly luminescence was normalized to Renilla luminescence 
values as an internal control for transfection efficiency. For the mutant 3’-UTRs 
in miR-198 targets, the miR-198 binding site, UCUGGAC, was converted to 
UUCAAGU using QuikChange Site-Directed Mutagenesis Kit (Stratagene) as 
per the manufacturer’s instructions. 

RNA-binding-protein immunoprecipitation. Keratinocytes, either treated with 
5ngml ' of TGF-B1 or mock treated, were lysed in radioimmunoprecipitation 
assay (RIPA) buffer. After clarifying the lysate at 13,000g for 10 min, the cell extract 
was incubated with 2 1g rabbit KSRP antibody (Bethyl Laboratories) or control 
rabbit IgG, overnight at 4°C. The antigen-antibody complex was pulled down 
with 25 pl (resin volume) of protein A sepharose pre-equilibrated with RIPA 
buffer at 4°C for 1h. The immunoprecipitated RNA bound to the resin was 
extracted using the Exiqon miRCURY RNA Isolation Kit. Reverse transcription 
of the immunoprecipitated RNA as well as the input extract RNA (10%) was 
carried out with Superscript III (Invitrogen) reverse transcriptase. Four microlitres 
of the complementary DNA was used in qRT-PCR using primers spanning 
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pre-miR-198 in the primary transcript. Results were normalized to input RNA 
levels and plotted as fold enrichment compared to the IgG control RIP. 

For RIP with skin samples, biopsies were snap frozen immediately in liquid 

nitrogen at 0h or 24h after injury. The epidermis was peeled off from the dermis 
after dispase treatment overnight and homogenized in RIPA buffer, clarified and 
processed for RIP as above. 
In vitro precursor miRNA processing assay. For preparation of pre-miR-198, 
pBluescript KS constructs containing pre-miR-198 sequence (obtained from 
miRBase) under the T7 RNA polymerase promoter were linearized and tran- 
scribed in the presence of radioactive [°*P]-UTP. Transcripts were resolved 
through 8% polyacrylamide gel, detected by autoradiography and gel purified in 
0.3 M NaCl, overnight at 4°C. The purified pre-miR-198 was denatured at 90 °C 
for 2 min and re-folded by slow cooling to room temperature (22 °C) in a buffer 
containing 25 mM Tris-Cl, pH 7.4, 100 mM NaCl, 5 mM MgCl, and 10% glycerol. 
The re-folded pre-miR-198 was stored in aliquots at —20 °C until further use. For 
the preparation of cytoplasmic extracts as the source of Dicer enzyme, subcon- 
fluent 293T cells were lysed in buffer containing 25 mM Tris-Cl, pH 7.4, 150 mM 
NaCl, 5mM MgCh, 0.25% NP-40 and protease inhibitor cocktail (Roche) for 
10 min on ice. After clarifying the extract at 10,000g for 10 min, the protein 
concentration was measured using the BCA assay kit (Pierce Biotechnology). 

In vitro precursor-miRNA processing assay was carried out with 50 ug of 293T 
extract and pre-miR-198 in the presence or absence of recombinant KSRP protein 


(Origene) for 60 min at room temperature. Samples were processed with phenol:- 
chloroform extraction and resolved through 8% denaturing poly acrylamide gel 
electrophoresis. To track the position of mature miR-198, the pre-miR-198 was 
treated with ShortCut RNase III (New England Biolabs) and processed as above. 
Low-range ssRNA ladder (New England Biolabs) and microRNA marker (New 
England Biolabs) were used to confirm the size of the reaction product. Reaction 
products were exposed to storage phosphor screens (GE Healthcare) and detected 
using Storm phosphorImager (GE Healthcare). 

Gel retardation assay. The pre-miR-198 substrate for gel retardation assay 
was prepared as described above. Pre-miR-198 transcripts with mutated terminal 
loop and stem sequences (Supplementary Fig. 7b) were generated by mutating 
the parental plasmid (pBSKS) constructs using QuikChange Site-Directed 
Mutagenesis Kit (Stratagene). Recombinant KSRP protein (0.5 1M) was incubated 
with the wild-type or mutant pre-miR-198 transcripts in 20 ul of reaction buffer 
containing 30mM Tris-HCl, pH 7.4, 5mM MgCl, 50mM KCI, 0.5mM DTT, 
40 Uml ' RNaseOUT, 250 ig ml of yeast transfer RNA and 10% glycerol. After 
30 min at room temperature, reaction products were resolved through 6% native 
polyacrylamide gel electrophoresis. The gel was dried and the protein-RNA com- 
plex and unbound RNA were visualized by phosphorimaging. 

Statistical analysis. Values are reported as the mean + the standard error. 
Statistical significance between two samples was determined with two-tailed 
Student’s t-test using GraphPad InStat 3.0 software (GraphPad Software). 
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MicroRNA-34a regulates cardiac ageing and function 
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Ageing is the predominant risk factor for cardiovascular diseases’ 
and contributes to a significantly worse outcome in patients with 
acute myocardial infarction’. MicroRNAs (miRNAs) have emerged 
as crucial regulators of cardiovascular function and some miRNAs 
have key roles in ageing**. We propose that altered expression 
of miRNAs in the heart during ageing contributes to the age- 
dependent decline in cardiac function. Here we show that miR-34a 
is induced in the ageing heart and that in vivo silencing or gene- 
tic deletion of miR-34a reduces age-associated cardiomyocyte cell 
death. Moreover, miR-34a inhibition reduces cell death and fibrosis 
following acute myocardial infarction and improves recovery of 
myocardial function. Mechanistically, we identified PNUTS (also 
known as PPP1R10) as a novel direct miR-34a target, which reduces 
telomere shortening, DNA damage responses and cardiomyocyte 
apoptosis, and improves functional recovery after acute myocardial 
infarction. Together, these results identify age-induced expression of 
miR-34a and inhibition of its target PNUTS as a key mechanism that 
regulates cardiac contractile function during ageing and after acute 
myocardial infarction, by inducing DNA damage responses and 
telomere attrition. 


To study the ageing process in the heart, we compared aged mice 
(18-20 months old) with young mice (6-8 weeks old). As expected, 
hearts of aged mice showed increased fibrosis (Fig. la), hypertrophy 
(Supplementary Fig. 2a), shorter telomeres (Fig. 1b) and increased 
cardiomyocyte apoptosis (Fig. 1c). Expression profiling of miRNAs 
isolated from the hearts of young and aged mice identified a number 
of dysregulated miRNAs that are known to have a role in cardiac 
(patho)physiology and in senescence (Fig. 1d and Supplementary 
Tables 1 and 2). For example, miR-21 and miR-574-5p are induced 
in the failing heart*®, and miR-146a (ref. 7) and miR-29 (ref. 8) increase 
during senescence, whereas other previously described age-associated 
miRNAs were not regulated’. Interestingly, the entire miR-34 family 
(comprising miR-34a, b and c) (Fig. le) was significantly upregulated 
in aged hearts, whereas several cell-type enriched miRNAs were 
unchanged (Supplementary Fig. 2b). Furthermore, miR-34a levels, 
but not levels of the muscle-enriched miR-1 and miR-133, also signifi- 
cantly correlated with age in human heart biopsies (n = 21), whereas 
levels of cardiac-enriched miR-208 declined (Fig. 1f, Supplementary 
Fig. 2c). As miR-34 is involved in apoptosis and senescence”'®"’, we 
focused on this miRNA family. 
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Figure 1 | Ageing induces cardiac miR-34 expression. a, Fibrosis staining of 
hearts from young (6-8 weeks old) and aged (18-20 months old) C57/B16 mice. 
b, c, Real-time PCR-based telomere length measurement of young and aged 
hearts (b) and cell death quantification using terminal deoxynucleotidyl 
transferase dUTP nick end labelling (TUNEL) (c). d, Microarray profiling 
results of miRNAs in aged and young hearts. e, Schematic representation of the 
primary transcripts encoding miR-34a, b, c, chromosomal location and 


sequence homologies in mouse. f, RNA was isolated from human atrium 
biopsies and miR-34a expression was measured by real-time PCR. g, Real-time 
PCR expression analysis of miR-34a in aged and young mice hearts. 

h, Quantitative real-time PCR for miR-34a, b and c on RNA from total young 
mouse heart. i, Young heart cell-types were separated using Langendorff- 
perfusion digestion and miR-34a levels were measured by real-time PCR. n = 3 
for all experiments, *P < 0.05. Error bars depict s.e.m. 
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Quantitative real-time polymerase chain reaction (PCR) confirmed 
the upregulation of miR-34a in aged mice (Fig. 1g) and showed that 
miR-34a is the predominantly expressed miR-34 family member in the 
heart (Fig. 1h). Moreover, miR-34a levels were higher in isolated car- 
diomyocytes compared to non-cardiomyocytes (Fig. li). These data 
were further corroborated by miR-34a in situ hybridization in mouse 
heart sections (Supplementary Fig. 3). 

As miR-34a has been shown to induce apoptosis in cancer cell lines" 
and apoptosis is a fundamental feature of ageing cardiomyocytes 
(Fig. 1c), we next assessed whether miR-34a has a role in cardiomyo- 
cyte apoptosis. Inhibition of miR-34a in rat neonatal cardiomyocytes 
reduced H,O,-induced apoptosis, whereas pre-miR-34a overexpres- 
sion augmented this apoptosis (Supplementary Fig. 4). To study the 
role of miR-34a in apoptosis in vivo, we inhibited miR-34a by antisense 
oligonucleotides (antagomirs; Ant-34a)'*, which reduced cardiac 
expression of miR-34a (Supplementary Fig. 5). To investigate whether 
miR-34a contributes to age-associated cell death in the heart, we treated 
18-month-old mice with 8 mgkg * Ant-34a, collected the hearts after 
l week and histologically quantified dead cells. Ant-34a treatment 
reduced the number of dead cells compared to controls (Fig. 2a). To 
further establish a role for miR-34a in cardiac ageing, we analysed 
cardiac contractile function by echocardiography in aged (73-83 weeks 
old) miR-34a knockout (miR-34a ‘~) mice (Fig. 2b). Whereas cardiac 
function in young miR-34a~'~ mice was comparable to wild-type 
littermates (Supplementary Fig. 6), the age-associated decline in 
cardiac function observed in the wild-type mice was prevented in 
miR-34a '~ mice (Fig. 2b). Furthermore, aged miR-34a~'~ mice 
hearts were less hypertrophic and showed less cell death, compared 
to wild-type controls (Fig. 2c and Supplementary Fig. 6). To further 
substantiate a role of miR-34a in cardiac ageing phenotypes, we ana- 
lysed Ku80-‘~ mice (Ku80 is also known as Xrcc5), a genetic mouse 
model of accelerated ageing. In Ku80‘~ mice cardiac miR-34a levels 
increased and pharmacological inhibition of miR-34a by locked 
nucleic acid (LNA)-based antimiRs abolished deterioration of cardiac 
contractile function (Fig. 2d and Supplementary Fig. 7). 

Because apoptosis and fibrosis are also hallmarks after an acute 
myocardial infarction (AMI) and age aggravates cardiac dysfunction 
after AMI, we next assessed the role of miR-34a in a mouse model of 
AMI. miR-34a expression was significantly upregulated in the border 
zone of the myocardial infarction (Fig. 2e). To test whether miR-34a 
contributes to cardiac dysfunction after AMI, we induced AMI in mice 
and injected antagomirs or LNA-based antimiRs. Whereas the extent 
of myocardial infarction was similar at day 0, cardiac contractile func- 
tion at two weeks after myocardial infarction was significantly better 
in antagomir Ant-34a or LNA-based antimiRs treated mice compared 
to controls, as measured by wall motion score index and ejection 
fraction (Fig. 2f and Supplementary Fig. 8). These findings were con- 
firmed by analysis of cell death and fibrosis on histological sections of 
the hearts (Fig. 2g-i). The reduction in the number of dead cells was not 
confined to cardiomyocytes, but also included non-cardiomyocytes 
(Supplementary Fig. 9). miR-34a was previously shown to induce 
senescence in endothelial cells'? and to inhibit pro-angiogenic cell 
functions'*!’. Therefore, we also investigated a potential impact of 
miR-34a inhibition on neovascularization. Indeed, Ant-34a treatment 
increased capillary density in the border zone of the infarct in vivo and 
miR-34a modulated angiogenic sprouting of human umbilical vein 
endothelial cells (HUVECs) in vitro (Supplementary Fig. 10). These 
findings indicate that miR-34a inhibition does not only improve 
cardiomyocyte survival but also has favourable effects on other cell 
types present in the heart, for example, endothelial cells. 

To identify putative target messenger RNAs of miR-34a, we used 
three microRNA target prediction tools, miRanda, PicTar and Tar- 
getscan 5.1 (Supplementary Fig. 11a). The 49 targets, that were pre- 
dicted by all three algorithms (Supplementary Table 3), were then 
compared to genes that were downregulated in aged mice (< 1.5-fold), 
as assessed in genome-wide mRNA expression profiles of aged and 
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Figure 2 | miR-34a reduction inhibits age-related and myocardial 
infarction-induced cardiomyocyte cell death and cardiac function. 

a, Cardiomyocyte cell death in aged mice (18 months old) treated with 
antagomir-34a or scrambled control antagomir. b, Echocardiography of 
miR-34a_'~ mice and miR-34*'* littermates at 73, 77, 79 and 83 weeks of age. 
Representative m-mode echocardiography images on the right. c, Cell death in 
hearts of miR-34a ‘~ and wild-type littermates d, Echocardiography in 
Ku80‘~ mice at 6, 8, and 10 weeks of age that received scrambled control 
locked nucleic acid modified antimiRs (LNA-Co) and LNA-34a weekly 
subcutaneous from 4 weeks of age. Representative m-mode echocardiography 
images on the right. e, RNA was isolated from the border zone of the 
myocardial infarction at 3, 5 and 7 days after AMI (black bars) or from sham- 
operated animals (white bar). miR-34a levels were measured by real-time PCR. 
f, Echocardiography of mice subjected to AMI (day 0). After AMI, mice 
received control antagomir or antagomir-34a. Echocardiographic data are 
presented as wall motion score index and ejection fraction. g, h, Histological 
sections of infarcted hearts four weeks after AMI were analysed for TUNEL- 
positive cells (g) and scar area (h). i, Representative scar area and TUNEL 
stainings. White arrows indicate TUNEL-positive cells. n = 4 per group for all 
experiments, *P < 0.05. Error bars depict s.e.m. 


young mice (Supplementary Table 4). Only one gene was both a pre- 
dicted target of miR-34a and downregulated by age on the mRNA 
level, Ppp1r10 (also known as Pnuts; Supplementary Fig. 11b). In 
line with the post-transcriptional mechanism of action of miRNAs, 
PNUTS was even more profoundly downregulated at the protein level 
in aged hearts (Fig. 3a). PNUTS has a role in apoptosis of cancer cell 
lines’®, interacts with the telomere regulator TRF2”, and is involved in 
DNA repair"’, indicating a potential role in ageing. To assess whether 
PNUTS is a direct target of miR-34a, we generated luciferase reporter 
constructs, in which the miR-34a seed target or the PNUTS 3’ untrans- 
lated region (3’UTR) is placed behind the luciferase gene. miR-34a 
inhibited luciferase activity, whereas no effect was observed when the 
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Figure 3 | miR-34a directly targets PNUTS. a, Western blot for PNUTS 
protein in aged (18-20 months) and young (6-8 weeks) mice hearts. 

b, Luciferase assay with wild-type PNUTS 3’UTR or PNUTS 3’UTR mutated in 
the predicted binding site of miR-34a, transfected with pre-miR-34a or control 
RNA. c, Mice were injected with antagomir-34a or control antagomir. Hearts 
were collected after 6 weeks and PNUTS protein levels were measured by 
western blot. d, PNUTS levels were measured by western blot in hearts of 19- 
21-week-old miR-34a‘'*, miR-34a‘'~ and miR-34a ‘~ mice. n = 3 for all 
experiments, *P < 0.05. Error bars depict s.e.m. 


miR-34a target site was mutated (Fig. 3b and Supplementary Fig. 11). 
Conversely, inhibition of miR-34a induced luciferase activity (Sup- 
plementary Fig. 11c). A reverse-transcriptase-based assay confirmed 
that miR-34a also binds to the PNUTS 3’UTR (Supplementary 
Fig. 1le). In vivo, Ant-34a-mediated inhibition of miR-34a upregu- 
lated PNUTS in the heart (Fig. 3c) and genetic deletion of miR-34a also 
dose-dependently increased cardiac PNUTS levels (Fig. 3d). Together, 
these results demonstrate that miR-34a directly targets PNUTS. 
Then, we determined whether PNUTS inhibits apoptosis of cardi- 
omyocytes. In PNUTS-overexpressing cardiomyocytes, H.O2-induced 
apoptosis was significantly lower and the miR-34a-mediated enhance- 
ment of apoptosis was completely abrogated (Fig. 4a). To assess if 
PNUTS can also prevent cardiac contractile impairment in vivo, we 
generated adeno-associated virus (AAV9) to overexpress PNUTS 
under control of a cytomegalovirus-enhanced myosin light chain car- 
diac promoter (Fig. 4b and Supplementary Fig. 12). Cardiac PNUTS 
overexpression prevented the deterioration of cardiac contractile 
function after AMI in mice (Fig. 4b). PNUTS is known to interact 
with TRF2, and together these proteins have been described to play 
a role in DNA damage responses (DDR)'*”. DNA damage responses 
can be activated by bona fide DNA damage or by telomere dysfunc- 
tion, when telomeres are aberrantly recognized as DNA damage, cul- 
minating in phosphorylation of CHK2 and apoptosis. Both telomere 
attrition and defects in DNA repair can cause cardiac dysfunction 
and cardiomyocyte apoptosis*””**. To assess whether inhibition of 
DNA damage responses could be a possible mechanism for the anti- 
apoptotic effects of PNUTS, we analysed phosphorylation of CHK2. 
CHK2 phosphorylation was diminished after ectopic expression of 
PNUTS in cardiomyocytes (Supplementary Fig. 13). Downregu- 
lation of TRF2 using specific small interfering RNAs showed that 
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Figure 4 | PNUTS inhibits apoptosis, telomere attrition and DNA damage 
response in cardiomyocytes. a, Cardiomyocytes were transduced with mock 
or PNUTS overexpression lentivirus and, after 3 days, were transfected with 
pre-miR-34a or control RNA. Apoptosis was stimulated with 100 1M H,O, on 
day 4 and quantified by flow cytometry on day 5. Histograms on the right 
represent total annexin V signal. b, Mice received adeno-associated virus 
(AAV9) to overexpress human PNUTS (AAV-PNUTS) or firefly luciferase 
(AAV-Luc) under the control of a cardiac-specific promoter 12-14 days before 
AMI. Heart function was assessed directly after AMI, after 7 and 14 days by 
echocardiography. PNUTS protein levels were assessed in hearts at 15 days 
after AMI. c, Cardiomyocytes were transduced with mock or PNUTS 
overexpression lentivirus and treated with doxorubicin (0.1 1M) or 
4-nitroquinoline 1-oxide (1 1M) for 24h or left untreated. y-H2A.X and 
a-tubulin were detected by western blot. d, Telomere length was assessed in 
human cardiomyocytes by quantitative fluorescent in situ hybridization at 

6 days after lentiviral transduction with mock, PNUTS, or mutant PNUTS 
virus, lacking TRF2 binding capacity (PNUTS-AA). Red line depicts average 
telomere length per group. n = 3 for all experiments, *P < 0.05. Error bars 
depict s.e.m. 


inhibition of CHK2 activation by PNUTS does not occur in the 
absence of TRF2 (Supplementary Fig. 13). As DNA damage responses 
can be caused by DNA damage as well as telomere dysfunction, we 
assessed the DNA damage marker yH2A.X and telomere length in 
cardiomyocytes. PNUTS reduced DNA damage in cardiomyocytes, as 
measured by y-H2A.X levels (Fig. 4c) and, consistently, inhibition of 
miR-34a also reduced cardiac DNA damage after AMI (Supplemen- 
tary Fig. 14). Furthermore, PNUTS overexpression, but not TRF2- 
binding-deficient PNUTS, reduced telomere attrition in vitro, without 
affecting telomerase activity (Fig. 4d and Supplementary Fig. 15). 
Together these data show that PNUTS inhibits DNA damage response 
signalling and prevents AMI-induced cardiac contractile dysfunction, 
whereas deciphering its function during ageing deserves further studies. 

In conclusion, our data demonstrate that inhibition of miR-34a 
reduces age-induced cardiac cell death and functional decline in vivo. 
Moreover, miR-34a is induced after acute myocardial infarction and 
inhibition of miR-34a enhances cardiac contractile recovery after acute 
myocardial infarction. miR-34a suppresses the expression of PNUTS, 
which inhibits telomere erosion, DNA damage response and apopto- 
sis, thus providing a rationale for the pro-apoptotic effects of miR-34a 
in cardiomyocytes. In addition to PNUTS, miR-34a has other targets, 
such as SIRT1 (ref. 23), that confers protection against ischaemia 
reperfusion injury in the heart (Supplementary Fig. 16)”. 
Nevertheless, we demonstrate that the miR-34a-PNUTS axis has 
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profound effects on cardiac apoptosis, indicating that PNUTS is a key 
downstream effector of miR-34a. Moreover, our findings that PNUTS 
is reduced in ageing and regulates telomere maintenance and DNA 
damage responses provide a plausible mechanism, by which telomeres 
erode during ageing. Other targets of miR-34a” and functions in non- 
cardiomyocytes’ could likewise contribute to its detrimental role in 
cardiac ageing and dysfunction. Finally, our results identify inhibition 
of miR-34a as a potential therapeutic strategy to improve cardiac 
contractile function after acute myocardial infarction. 


METHODS SUMMARY 

Laboratory animals, ethics and acute myocardial infarction model. Male 
C57BL/6 mice were obtained from Charles River and Janvier. The animal experi- 
ments were approved by the Regional Board of the State of Hessen, Germany. 
Patient material was obtained under informed consent and according to the 
declaration of Helsinki. Acute myocardial infarction operations were performed 
as described”®. 

RNA isolation, real-time PCR and transcriptional profiling. Total RNA was 
isolated with miRNeasy kits from Qiagen and real-time PCR with Applied 
Biosystems microRNA assays run on an Applied Biosystems StepOnePlus. 
Quantification of real-time PCR was performed using synthetic microRNA 
sequences in a calibration curve. Transcriptional profiling was performed by 
DNAvision on the Agilent-v1 platform (570 miRNAs) or on the Affymetrix 430 
v2.0 platform (mRNA, full genome). 

Statistical analysis. Data were analysed with GraphPad Prism 5 using student’s 
t-tests when comparing two conditions, or with an analysis of variance (ANOVA) 
with Bonferroni or Newman-Keuls correction for multiple comparisons. 
Probability values of less than 0.05 were considered significant and tests were 
performed two-sided. Data was tested for outliers using the Grubbs outlier test 
and, if outliers were present, they were removed. Data are presented as mean. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Laboratory animals, ethics and acute myocardial infarction model. C57BL/6 
mice were obtained from Charles River and Janvier. Sirtl*/~ mice?” and Ku80/~ 
mice were described previously”. miR-34a '‘~ mice (SV/C57BI6 background) 
were generated using homologous recombination with a vector containing 
miR-34a seed flanked with loxP sites and an intronic neomycin resistance (Neo) 
cassette flanked by frt sites. The Neo cassette was removed by crossing to flp-mice 
and miR-34a by crossing with CMV-Cre-mice. As a control, mice homozygous for 
miR-34a flanked with loxP sites were used. Generation of miR-34a ‘~ mice willbe 
described in detail elsewhere (Hermeking et al., in preparation). miR-34a '~ mice 
showed no obvious phenotype and were born at normal Mendelian ratio. The 
animal experiments were approved by the Regional Board of the State of Hessen, 
Germany. Patient material was obtained under informed consent and according to 
the declaration of Helsinki. Acute myocardial infarction operations were per- 
formed as described’® and the mice were subsequently evaluated to assess the 
infarct size, after which mice received treatment. Echocardiography was per- 
formed using a Vevo-770 (Visualsonics) by a researcher blinded to the experi- 
mental conditions. Antagomirs (8 mgkg *) were synthesized by VBC Biotech 
as described previously’®. The following sequences were used in this study: 
Ant-Control: 5’-AAGGCAAGCUGACCCUGAAGUU-3’ and Ant-34a: 5'-AC 
AACCAGCUAAGACACUGCCA-3’. LNA-Anti-miRs (LNA/DNA mixmers) 
(20mgkg ' in Ku80-‘~ mice and 5mgkg ! in wild-type mice) were from 
Exiqon and the sequences were LNA-Control: 5’-ACGTCTATACGCCCA-3’, 
Tm 75°C; and LNA-34a: 5'-AGCTAAGACACTGCC-3’, Tin 74 °C. 

Cell culture, transfection and flow cytometry. Rat neonatal cardiomyocyte 
isolation was performed as described”. Human (GATA-4* /a-sarcomeric actin" ) 
cardiomyocytes were purchased from Promocell. Separation of mouse cardiomyo- 
cytes and non-cardiomyocytes using Langendorff-perfusion-mediated digestion 
was performed as described’®. Transfection of cardiomyocytes with pre-miR con- 
structs (pre-miR-34a: 5’-GGCCAGCUGUGAGUGUUUCUUUGGCAGUGUC 
UUAGCUGGUUGUUGUGAGCAAUAGUAAGGAAGCAAUCAGCAAGUAU 
ACUGCCCUAGAAGUGCUGCACGUUGUGGGGCCC-3’; _ pre-miR-control: 
Pre-miR Negative Control #1 from Life Technologies) and anti-miR hairpin inhi- 
bitors (miRIDIAN microRNA hsa-mir-34a hairpin inhibitor and miRIDIAN 
microRNA Hairpin Inhibitor Negative Control #1 from Thermo Fischer), both 
at 30nM, as well as siRNA (mix of two sequences targeting Trf2 (5'-CCC 
AAAGUACCCAAAGGCA-3’ and 5’-GCUUCAGCGCCACCAUCCA-3’, 20 nM 
each) or one negative control siRNA (5'-CGUACGCGGAAUACUUCGA-3’, 
40 nM)) transfection was performed using HiPerfect reagent (Qiagen). Apoptosis 
was quantified using annexin V and 7-AAD staining and flow cytometry analysis 
using a Cantoll (BD Biosciences). Annexin V-positive/7-A AD-negative cell popula- 
tions were considered apoptotic. 

RNA isolation, real-time PCR and transcriptional profiling. Total RNA isola- 
tion was performed with miRNeasy kits from Qiagen and real-time PCR with 
Applied Biosystems microRNA assays run on an Applied Biosystems Step- 
OnePlus. miRNA levels were normalized with Rnu6 and Sno202 expression and 
mRNA levels with Rplp0. Quantification of real-time PCR was performed using 
synthetic microRNA sequences in a calibration curve. Transcriptional profiling 
was performed by DNAvision on the Agilent-v1 platform (570 miRNAs) or on the 
Affymetrix 430 v2.0 platform (mRNA, full genome). Data was background- 
corrected and normalized by quantile normalization. Statistical comparisons were 
done using a Bayesian one-way ANOVA and probability values were corrected for 
multiple testing with the Benjamini-Hochberg false discovery rate method”*. Data 
are available at the National Center for Biotechnology Information Gene 
Expression Omnibus (GEO) and are accessible through GEO Series accession 
number GSE43556. 

Microscopy, microRNA in situ hybridization, telomere assays and histology. 
Confocal microscopy and in situ hybridization was performed as described 
before”, with the following modifications: miR-34a LNA-modified detection 
probe (5’ and 3’ DIG-labelled) was from Exiqon, hybridization temperature 
57 °C. Telomere length was measured using real-time PCR, as described before”', 
as well as by quantitative fluorescence in situ hybridization (Q-FISH), using the 
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telomere PNA FISH Cy3 kit (Dako). Chromosome spreads were generated by 
addition of Colcemid (0.1 pg ml ', 24h) and after hybridization, images were 
captured with a Zeiss Axio Observer and X63 objective. Telomere signals were 
quantified with TFL-Telo in a blinded manner*’. Green fluorescent microspheres 
(Invitrogen) were used to normalize between slides. Telomerase activity was mea- 
sured with the TRAPeze ELISA telomerase kit (Millipore). TUNEL (cell death) 
and Sirius red (fibrosis) staining was performed as described’*. Fibrosis in 
infarcted hearts was quantified as described*’. 

Luciferase assays, lentivirus, AAV and western blotting. Luciferase constructs 
were made by inserting the full-length human PNUTS 3’UTR, obtained from 
Imagenes in the psiCheck2 vector (Promega), in which the putative miR-34a seed 
sequence binding site (5'-CACTGCC-3’) was mutated to 5'-CTGTCGC-3’. 
Smaller UTR fragments were inserted using the following oligos: 2x-miR- 
34a: 5'-aagtcacactacatgcactgcctttgaaagtcacactacatgcactgcctt-3’, 2x-miR-34a-mut: 
5'-aagCTacacATcatgcT GtCGctttgaaagCTacacATcatgeTGtCGett-3’, miR-34a- 
footprint: 5’ -tgctgggcctgaaagtcacactacatgcactgcctttggeagtcagctcactccctgctcc-3’, mut- 
footprint: 5’ -tgctgggcctgaaagCTacacATcatgcT GtCGctttgggagtcagctcactccctgctcc-3’. 
Mutations are highlighted as uppercase. psiCheck2 plasmids were transfected in 
HEK293T cells using Genejuice (Merck) and pre-miR-34a and control RNA 
(Applied Biosystems), as well as LNA-Control and LNA-34a were transfected 
using Ribojuice (Merck). PNUTS cDNA was obtained in pDONR221 
(Imagenes) and shuttled to the pLenti4 vector (Invitrogen). Lentiviral particles 
were generated as described™. Western blotting was carried out as described”®. For 
adeno-associated virus vector generation, PNUTS complementary DNA flanked 
by BamHI and Xbal restriction sites was inserted into pUF-CMV/MLC0.26, a 
derivate of pUF-CMV/MLC0.26-uDys*. An AAV vector genome containing 
Renilla luciferase under control of the CMV/MLC0.26 promoter was used as 
control. AAV9 vectors were generated and titrated as described before****. Mice 
received 7 X 10'! vector genomes. Anti-PNUTS antibodies were from R&D 
Systems, anti-y-H2A.X, anti-P-CHK2 and anti-CHK2 antibodies were from Cell 
Signaling. 

Statistical analysis. Data were analysed with GraphPad Prism 5 using student’s 
t-tests when comparing two conditions, or an ANOVA with Bonferroni or 
Newman-Keuls correction for multiple comparisons. Probability values of less 
than 0.05 were considered significant and tests were performed two-sided. Data 
was tested for outliers using the Grubbs outlier test and, if outliers were present, 
they were removed. Data are presented as mean and error bars depict the standard 
error of the mean (s.e.m.). 
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Non-optimal codon usage affects expression, 
structure and function of clock protein FRQ 


Mian Zhou!, Jinhu Guo}, Joonseok Chal, Michael Chae', She Chen’, Jose M. Barral’, Matthew S. Sachs‘ & Yi Liu! 


Codon-usage bias has been observed in almost all genomes and is 
thought to result from selection for efficient and accurate trans- 
lation of highly expressed genes’*. Codon usage is also implicated 
in the control of transcription, splicing and RNA structure*®. 
Many genes exhibit little codon-usage bias, which is thought to 
reflect a lack of selection for messenger RNA translation. Alter- 
natively, however, non-optimal codon usage may be of biological 
importance. The rhythmic expression and the proper function of 
the Neurospora FREQUENCY (FRQ) protein are essential for 
circadian clock function. Here we show that, unlike most genes 
in Neurospora, frq exhibits non-optimal codon usage across its 
entire open reading frame. Optimization of frq codon usage 
abolishes both overt and molecular circadian rhythms. Codon 
optimization not only increases FRQ levels but, unexpectedly, also 
results in conformational changes in FRQ protein, altered FRQ 
phosphorylation profile and stability, and impaired functions in 
the circadian feedback loops. These results indicate that non- 
optimal codon usage of frq is essential for its circadian clock func- 
tion. Our study provides an example of how non-optimal codon 
usage functions to regulate protein expression and to achieve 
optimal protein structure and function. 

Eukaryotic circadian oscillators consist of autoregulatory circadian 
negative-feedback loops. In the core circadian oscillator of Neurospora 
crassa, FRQ protein is a central component that functions as the cir- 
cadian negative element with its partner FRH’’. Two transcription 
factors, WHITE COLLAR (WC)-1 and WC-2, form a heterodimeric 
complex that activates frq transcription’. The FRQ-FRH complex 
inhibits WC complex activity by interacting with the WC proteins'"”. 
The level and stability of FRQ have a key role in setting period length, 
phase and clock sensitivity to environmental signals’*'*. In addition, 
FRQ promotes the expression of both WC proteins in an interlocked 
positive-feedback loop'®™*. 

The protein-coding genes of Neurospora exhibit strong codon bias 
(Supplementary Fig. 1a). The third position of almost every codon 
family in this filamentous fungus has the preference C>G>T>A. 
Codon optimization enhances the expression of a heterologous luci- 
ferase gene in Neurospora’*’*. To establish that codon-usage bias reg- 
ulates protein expression, we identified the most abundant Neurospora 
proteins in a whole-cell extract by mass spectrometry analyses. The 
genes encoding the top 100 most abundant proteins (Supplementary 
Table 1) exhibit much stronger codon bias than the rest of the protein- 
coding genes (Supplementary Fig. 1b). 

We classified all predicted Neurospora transfer RNA genes and pre- 
dicted the relative translation elongation rate for each codon on the 
basis of tRNA-gene copy numbers, which correlate with tRNA abund- 
ance, and the nature of anticodon-codon interactions*!’. The most 
preferred codon for each amino acid is always the codon with highest 
predicted translation elongation rate (Supplementary Table 2). There- 
fore, to ensure efficient translation of abundant proteins, selection 


pressure favoured a bias for codons translated by highly abundant 
tRNA species. 

Many Neurospora genes exhibit little or no codon bias (Supplemen- 
tary Fig. 1a). FRQ is a low-abundance Neurospora protein. Its codon 
bias index'* (CBI; in which CBI=0 indicates completely random 
codon usage) value of 0.08 indicates that frq has little codon bias 
(Supplementary Fig. 1b). A codon-usage graph of the frq open reading 
frame (ORF) shows that many regions have non-optimal usage 
(Fig. 1a), whereas frh has good codon usage throughout its ORF. 

We created two constructs in which the amino-terminal end (amino 
acids 1-164) of frq was codon optimized. In the m-frq construct, only 
the non-preferred codons were changed, whereas every codon was 
optimized in the f-frq construct. Predicted stability of RNA secondary 
structure was not notably affected by the optimization (Supplementary 
Table 3). These constructs and the wild-type frq (wt-frq) gene were 
transformed individually into a frq null strain (frq’°). Both m-frq and 
f-frq strains have significantly higher levels of FRQ proteins in con- 
stant light than that of the wt-frq strain (Fig. 1b and Supplementary 
Fig. 2a). On the other hand, frq mRNA levels were comparable in these 
strains (Supplementary Fig. 2b). FRQ is known to upregulate WC 
protein levels'®’*. The WC-1 and WC-2 levels, however, were similar 
in these strains despite the much higher levels of FRQ in the optimized 
strains (Fig. 1b and Supplementary Fig. 2a, c). 

The wt-frq construct was able to fully rescue the arrhythmic con- 
idiation rhythm of the frq’° strain in constant darkness (Fig. 1c), but 
both of the optimized frq strains exhibited arrhythmic conidiation 
phenotypes; these are not due to the modest changes in the ratios of 
two alternatively translated FRQ forms, as either form of FRQ alone 
can support robust rhythms’””®. 

We created two additional constructs (m1-frg and m2-frq), in which 
only the amino- or carboxy-terminal segments of the optimized region 
of m-frq, respectively, were optimized. The frq’° transformants car- 
rying either construct exhibited long-period conidiation rhythms and 
had FRQ levels between those of wt-frq and m-frq strains (Fig. 1c and 
Supplementary Fig. 3a). These results indicate that the severe conidia- 
tion rhythm phenotypes of the m-frq and f-frq strains are due to the 
cumulative effect of codon optimization and are not likely due to 
mutation of a DNA or RNA element. 

To examine circadian phenotypes at the molecular level, we introduced 
a luciferase reporter construct that is under the control of the frq pro- 
moter’® into wt-frq, m-frq and f-frq strains. As shown in Fig. 1d and 
Supplementary Fig. 3b, the robust rhythmic luciferase activity was abo- 
lished in the optimized strains. FRQ protein levels also lost molecular 
rhythmicity in the optimized strains (Fig. le, f and Supplementary 
Fig. 3c, d): the overall levels of FRQ were high and circadian changes in 
FRQ abundance and phosphorylation profile were abolished. In addition, 
FRQ stayed hyperphosphorylated in constant darkness in the optimized 
strains. Together, these results indicate that the non-optimal codon usage 
of frq governs FRQ expression level and is essential for clock function. 
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Figure 1 | Codon optimization of frq results in high FRQ expression levels 
and loss of circadian rhythmicities. a, Codon-usage score plots of frq and frh 
obtained using Codon Usage 3.5. CC, coiled-coil domain; FCD, FRQ-CK1 

interaction domain; FRQ-FRH, FRQ-FRH interaction domain; NLS, nuclear- 
localization signal; nt, nucleotide; PEST, proline (P), glutamic acid (E), serine 
(S) and threonine (T) domain. b, Western blot showing the levels of FRQ and 
WC proteins in wt-frq, m-frq and f-frq strains. Three independent cultures in 


The loss of circadian rhythms in the optimized strains is surpri- 
sing, as we previously showed that high FRQ levels do not result in 
arrhythmicity’”!, suggesting that codon optimization causes defects 
in FRQ function. FRQ fulfils its function in the circadian negative- 
feedback loop through its interaction with WC proteins’. Immuno- 
precipitation assays showed that the relative amounts of FRQ associated 
with WC-2 were significantly decreased in both optimized strains 
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constant light at 25 °C were used. c, Race tube analysis showing the conidiation 
phenotypes in constant darkness (DD). Black lines indicate the growth fronts 
every 24h. LL, constant light. d, Luciferase reporter assay showing frq promoter 
activity of the indicated strains after 1 day in constant darkness. The 
measurement of luciferase activity was normalized to subtract the baseline 
luciferase signal. A.U., arbitrary units. e, f, Western blots showing loss of FRQ 
expression rhythms in the codon-optimized strains in DD. 


(Fig. 2a), suggesting that the FRQ function in the negative-feedback loop 
is impaired in the optimized strains. 

FRQ also acts in a positive-feedback loop by promoting WC protein 
expression’®™*. Constructs qa-m-frq and qa-f-frq, in which either 
m-frq or f-frq is under the control of a quinic-acid-inducible promoter, 
respectively, were introduced into the frq null strain. As expected, FRQ 
levels were higher in the qa-m-frq and qa-f-frq strains than the control 
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Figure 2 | FRQ activities in circadian feedback loops are impaired in the frq 
codon-optimized strains. a, Left, immunoprecipitation assay showing that FRQ 
has a decreased ability to interact with WC-2 in the codon-optimized strains. Two 
different exposures (exp.) of the WC-2 western blot were shown. Right, 
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densitometric analyses of results from four independent experiments. IP, 

immunoprecipitation; PI, pre-immune serum. Error bars denote + s.e.m. (n = 9). 
**P << 0.01. b, c, Western blots showing the levels of WC proteins and FRQ in the 
indicated strains at different concentrations of quinic acid (QA) in constant light. 
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ga-wt-frq strain at a given quinic acid concentration (Fig. 2b, c and 
Supplementary Figs 4 and 5). Induction of FRQ resulted in increased 
levels of WC proteins, demonstrating the role of FRQ in the positive- 
feedback loop. At quinic acid concentrations higher than 1 X 10 * M, 
however, the qa-m-frq and qa-f-frq strains had lower levels of WC-1 
and WC-2 than the qa-wt-frq strain, despite having higher FRQ levels, 
indicating that FRQ function in the positive-feedback loop is also 
impaired in the codon-optimized strains. 

The fact that FRQ function is impaired in codon-optimized strains 
despite higher FRQ protein levels suggests that the structural con- 
formation of FRQ is altered. FRQ becomes progressively phosphory- 
lated before its degradation’. Figure 3a shows that, in both m-frq and 
f-frq strains, FRQ was less stable than in the wild-type strain after the 
addition of cycloheximide (CHX). The difference in FRQ stability 
was most pronounced after 3h of CHX treatment, a time when FRQ 
became hyperphosphorylated. 
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FRQ from the optimized strains was also less stable after protein 
extraction in vitro after freeze-thaw cycles. Although FRQ levels in 
freshly isolated samples were significantly higher in optimized strains, 
they decreased rapidly after freeze-thaw cycles (Fig. 3b). By contrast, 
expression of wt-frq in a we-2 knock-out strain to a level that is com- 
parable to that of the optimized frq strains did not affect the freeze- 
thaw sensitivity of FRQ (Supplementary Fig. 6), indicating that the 
change in FRQ stability in the optimized strains is not due to high FRQ 
level or its reduced ability to interact with WC proteins. Furthermore, 
limited trypsin digestion showed that full-length FRQ in either opti- 
mized strain was more sensitive to trypsin digestion than in the control 
strain (Fig. 3c). 

We reasoned that the changes in FRQ conformation in the opti- 
mized strains are due to an increase in the translation rate as a result of 
codon optimization. Thus, we examined whether FRQ in the f-frq 
strain can be rescued by the decreasing protein-translation rate at 
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Figure 3 | FRQ protein in the codon-optimized strains is less stable and 
more sensitive to trypsin digestion. a, Top, western blots showing FRQ 
degradation after CHX treatment (10 jig ml"). A longer exposure for the wt- 
frq strain was used so that the FRQ signals at time 0 are comparable between the 
three strains. Bottom, densitometric analyses of results of four independent 
experiments. b, c, Western blots showing sensitivity of FRQ from codon- 
optimized strains to freeze-thaw (F-T) cycles (b) and trypsin (1 pg ml) 


Time after trypsin addition (min) 


digestion (c, top). A longer exposure for the wt-frq strain was used in 

c. Densitometric analyses of FRQ levels of three independent experiments are 
shown (c, bottom). d, e, Western blots showing that FRQ from the f-frq strain 
grown at 18 °C is more resistant to freeze-thaw cycles (d; n = 2) and to trypsin 
digestion (e; n = 4) than that grown at 25 °C. *P < 0.05, **P < 0.01. For d and 
e, the densitometric analyses of the results are also shown. Error bars in 

a, c, e denote + s.d. 
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18°C, as low temperature is known to reduce FRQ expression”! 
(Fig. 4d). The low-temperature treatment was not successful in resto- 
ring the circadian conidiation rhythm of the f-frq strain (data not 
shown), which was not surprising because 18 °C is near the lower limit 
of temperature range permissive for rhythmicity’. FRQ in the f-frq 
strain at 18°C, however, is much less sensitive to freeze-thaw cycles 
and to trypsin digestion than at 25 °C (Fig. 3d, e). These results suggest 
that frq codon optimization changes FRQ structure as a consequence 
of increasing translation rate. 

To determine whether the codon-usage effect on FRQ is limited to 
the N terminus of FRQ, we created a mid-frq strain, in which the middle 
region (amino acids 185-530) of the frq ORF is optimized. This region 
contains two casein kinase 1 (CK1)-interaction domains and most of 
the phosphorylation sites that are important for FRQ stability and 
period determination”. Thus, changes of FRQ structural conforma- 
tion in this region should affect FRQ phosphorylation and stability. As 
with m-frq and f-frq strains, the mid-frq strain exhibited arrhythmic 
conidiation (Fig. 4a). In the mid-frq strain, FRQ levels were high, and 
FRQ levels and phosphorylation profile were not rhythmic (Fig. 4b, c). 
No significant difference in frq mRNA was observed (Supplementary 
Fig. 7). Notably, in the mid-frq strain, FRQ accumulated in hypopho- 
sphorylated forms, was more stable after CHX treatment and was 
more resistant to trypsin digestion (Fig. 4b-e). In addition, the CHX- 
triggered rapid hyperphosphorylation of FRQ in the wt-frq strain was 
abolished in the mid-frq strain*’. These results strongly indicate that 
the non-optimal codon usage of frq is important for FRQ structural 
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Figure 4 | Codon optimization of the middle region of FRQ impairs FRQ 
phosphorylation and stabilizes FRQ. a, Race tube analysis showing the 
conidiation phenotypes of indicated strains in constant darkness. b, Top, 
western blot showing FRQ expression profile in constant light. Bottom, 
densitometric analyses of three independent samples. c, Top, western blots 
showing FRQ expression profile in constant darkness. Bottom, densitometric 
analysis of the result. d, Top, western blot showing the degradation of FRQ after 
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required for proper phosphorylation and stability. The opposite 
molecular phenotypes of mid-frq and the N-terminal-optimized strains 
indicate that changes in FRQ structural conformation in these strains 
are due to location-specific codon optimization. 

In a companion study, codon optimization of kaiBC clock genes in 
cyanobacteria results in high Kai protein levels and impaired cell growth 
at cool temperatures”, suggesting that non-optimal codon usage is a 
shared adaptive mechanism in both prokaryotes and eukaryotes. 

Our study suggests that codon usage regulates not only protein- 
expression level but also its structure and function. Protein folding, 
which occurs cotranslationally, requires protein chaperones and suf- 
ficient amounts of time’’. Codon optimization results in increased 
translation rates and thus reduces the time available for folding. 
Bioinformatics analyses and heterologous protein-expression studies 
previously implicated codon usage as a factor that regulates protein 
folding” **. In addition, a single rare codon in the human MDR1 
(also known as ABCB1) gene results in altered drug and inhibitor 
interaction”. 

In sharp contrast with the cyanobacterial Kai proteins”, most of the 
FRQ protein is predicted to be disordered (Supplementary Fig. 8). 
Interestingly, the known domains of FRQ all have relatively low dis- 
order tendencies and fall in regions where codon-usage scores are 
relatively high, suggesting that a fast translation rate in these well- 
structured regions does not interfere with protein folding. On the other 
hand, although the disordered regions may not form stable structures 
by themselves, they are critical for proper FRQ phosphorylation and 
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stability***. They may serve as platforms for inter- or intramolecular 
protein-protein interactions, which may require more time for protein 
folding than well-structured domains. Therefore, frq non-optimal 
codon usage should be a mechanism that allows the proper folding 
of FRQ by reducing translation rate in these predicted disordered 
regions. 


METHODS SUMMARY 


Codons were optimized on the basis of Neurospora codon-usage frequency. 
Codon-usage score plot was obtained using software Codon Usage 3.5 (developed 
by Conrad Halling). CBI, frequency of optimal codons (Fop), ENC (effective 
number of codons) and GC (guanine-cytosine) content values were calculated 
by codonw (http://mobyle.pasteur.fr/cgi-bin/portal.py#forms::codonw). Optimized 
frq constructs were transformed into a bd, frq’®, his-3 strain. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Strains and growth conditions. Neurospora strains used in this study were 87-3 
(bd a; clock wild-type), 303-3 (bd, frq’®, his-3)*! and different frq N-terminal- 
codon-optimized strains created in this study. Strain 303-3 was used as the host 
strain for various his-3-targeting constructs. The frq’’, bd, wt-frq (frq’° containing 
a wild-type frq gene at the his-3 locus) strain was used as the control in this study. 

Growth conditions have been described previously”. Liquid cultures were 
grown in minimal medium (1 X Vogel’s, 2% glucose). When quinic acid was used, 
liquid cultures were grown in (10 °-10 7M) quinic acid, pH5.8, 1 X Vogel’s, 
0.1% glucose and 0.17% arginine. Race tube media contained 1 x Vogel’s, 0.1% 
glucose, 0.17% arginine, 50ng ml * biotin and 1.5% agar. For rhythmic experi- 
ments, the Neurospora cultures were transferred from LL to DD at time 0 and were 
collected in DD at the indicated time (hours). Calculations of period length were 
performed as described”. 
frq codon optimization, codon-usage score plot and indices calculation. frq 
codon optimization was performed for the N-terminal part (1-498 nucleotides) or 
the middle region (553-1,590 nucleotides) of the ORF. The nucleotide sequences 
of the optimized frq are shown in Supplementary Figs 9 and 10. Codons were 
optimized on the basis of N. crassa codon-usage frequency and the predicted most 
efficient codon on the basis of tRNA copy numbers. In total, 65 codons were 
optimized in the m-frq construct, whereas 94 codons were optimized in the 
f-frq construct. Sequences surrounding an alternative frq 3’ splice site in this 
region were not mutated. 

Codon-usage score plot was obtained using Codon Usage 3.5 (developed by 
Conrad Halling) using a window size of 35 and logarithmic range of 3. The N. crassa 
codon-usage frequency table was obtained from http://www.kazusa.or.jp/codon/. To 
calculate the CBI, frequency of optimal codons (Fop), ENC (effective number of 
codons) and GC (guanine-cytosine) content, codonw in the Mobyle Portal website 
(http://mobyle.pasteur.fr/cgi-bin/portal.py#forms::codonw) was used'**’. CBI will 
equal 1.0 if a gene has extreme codon bias and will equal 0 if codon usage is 
completely random. If the number of optimal codons is less than expected by 
random change, the CBI value will be a negative value. Neurospora genome 
sequences were downloaded from the Broad Institute N. crassa database (http:// 
www. broadinstitute.org/annotation/genome/neurospora/MultiHome.html). The 
top 100 most abundant proteins were identified by mass spectrometry analyses 
and ranked by their emPAI (exponentially modified protein abundance index) 
values** (Supplementary Table 1). 

Plasmid constructs and Neurospora transformation. pKAJ120 (containing the 
entire wild-type frq gene including its promoter and a his-3-targeting sequence) 
and pBASO (containing the wild-type frq gene under the control of the ga-2 
promoter and a his-3-targeting sequence) were used as the parental plasmids to 
create the optimized frq constructs”. Optimized frq sequences (synthesized by 
Genscript) were subcloned into parental plasmids to replace the wild-type frq gene 
to generate the m-frq, f-frq and mid-frq constructs. In the m1-frq construct, only 
the codons upstream of the predicted intron branch point were optimized as 
m-frq. For the m2-frq construct, only the codons downstream of the intron 3’ 
end were optimized as m-frq. The resulting constructs were transformed into 
strain 303-3 by electroporation and targeted to the his-3 locus*. Homokaryon 
strains were obtained by microconidia purification. 

Protein and RNA analyses. Protein extraction, western blot analysis and immu- 
noprecipitation assays were performed as previously described****. Equal amounts 
of total protein (50 pg) were loaded into each lane of SDS-PAGE gels (7.5% SDS- 
PAGE gels containing a ratio of 37.5:1 acrylamide/bisacrylamide). Densitometry of 
the signal was performed by using Image J. 

RNA extraction and quantitative reverse transcriptase PCR (qRT-PCR) were 
performed as previously described’. For qRT-PCR, the primer sequences used 
for frq were 5'-GGAGGAGTCGATGTCACAAGG-3’ (forward) and 5'-CACTTC 
GAGTTACCCATGTTGC-3’ (reverse). The Neurospora gene coding for B-tubulin 
was used as an internal control. The primer sequences specific for B-tubulin were 5'- 
GCGTATCGGCGAGCAGTT-3' (forward) and 5'-CCTCACCAGTGTACCAAT 
GCA-3' (reverse). 

frq mRNA secondary structure and folding energy was predicted by the mfold 
program (http://mfold.rna.albany.edu/?q=mfold/RNA-Folding-Form). 

Mass spectrometric analyses and database search. The Neurospora proteins 
were separated on a 4-20% SDS-PAGE gradient gel. The whole-protein gel lane 
was sliced equally into 14 gel blocks from top to bottom. Each gel block was de- 
stained and then digested in-gel with sequencing grade trypsin (10 ng pl! trypsin 
in 50mM ammonium bicarbonate, pH 8.0) at 37°C overnight. The resulting 
tryptic peptides from each gel block were extracted with 5% formic acid/50% 
acetonitrile and 0.1% formic acid/75% acetonitrile sequentially and then concen- 
trated to ~20 pl in a CentriVap system (Labconco). The extracted peptides from 
each sample were separated by a home-made analytical capillary column (50 tm 
xX 10cm) packed with 5-j1m spherical C18 reversed-phase material (YMC). An 


Agilent 1100 binary pump was used to generate high-performance liquid chro- 
matography gradient as follows: 0-5% B in 5 min, 5-40% B in 55 min, 40-100% B 
in 15 min (A = 0.2 M acetic acid in water; B = 0.2 M acetic acid/70% acetonitrile). 
The eluted peptides were sprayed directly into a LTQ mass spectrometer (Thermo 
Fisher Scientific) equipped with a nano-ESI ion source. The mass spectrometer 
was operated in data-dependent mode (MS scan mass range was 350-2,000 Da, the 
top-five most abundant precursor ions from each MS scan were selected for MS- 
MS scans, and dynamic exclusion time was 30s). The mass spectrometric data 
from all 14 samples were combined and searched against Neurospora protein 
database on an in-house Mascot server (Matrix Science). The main search para- 
meters were as below: protein N-terminal acetylation and methionine oxidation 
were included as variable modifications; two missed cleavage sites were allowed; 
precursor ion mass tolerance was set as 3 Da; fragment ion mass tolerance was 
0.8 Da. Only peptides with E value above 0.01 were retained. The emPAI* was 
calculated for each protein by the Mascot software. 

Luciferase reporter assay. The luciferase reporter construct (frq-luc-bar) was 
generated by insertion of the 4.7-kb BamHI-Notl fragment of pfrq-luc-I (a generous 
gift from J. Dunlap)'* into the corresponding sites of pBARKS1. This construct, 
which contains the luciferase gene under the control of the frq promoter and the bar 
gene, was transformed into 87-3 (bd a), wt-frq (frq"”, bd, wt-frq), m-frq (frq’®, bd, 
m-frq) and f-frq (frq'®, bd, f-frq) strains. Transformants were selected using the 
basta/ignite (200 jig ml’) resistance conferred by the bar gene. 

LumiCycle (Actimetrics) was used for the luciferase assay using a protocol simi- 
lar to previously described'®. The AFV (autoclaved FGS-Vogel’s) medium con- 
tained 1 X FGS (0.05% fructose, 0.05% glucose, 2% sorbose), 1 X Vogel’s medium, 
50 gl | biotin and 1.8% agar. Firefly luciferin (L-8200 p-luciferin firefly (syn- 
thetic) potassium salt; BioSynth) was added to the medium after autoclaving (final 
concentration 50 1M). One drop of conidia suspensions in water were placed on 
AFV medium and grown in LL overnight. The cultures were then transferred to 
darkness, and luminescence was recorded in real time DD using a LumiCycle after 
1 day in DD. The data were then normalized with LumiCycle analysis software by 
subtracting the baseline luciferase signal, which increases as cell grows. Therefore, 
the normalized luciferase signals reflect the amplitude of the rhythm and do not 
reflect the absolute luciferase signal. Under our experimental conditions, luciferasae 
signals were highly variable during the first day in the LumiCycle and became 
stabilized afterwards, which is probably because of an artefact caused by the 
light-dark transfer of the cultures. Thus, the results presented were recorded after 
1 day in DD. 

Protein-stability assay. The liquid cultures of Neurospora strains were grown in 
LL for 1 day before the addition of CHX (final concentration 10 pig ml). Cells 
were collected at the indicated time points. 

For the trypsin-sensitivity assay, protein extracts were diluted to a protein 
concentration of 2.5 tg pl” "100-11 extracts were treated with trypsin (final con- 
centration 1 jug ml~ ") at 25 °C. A 20-1l sample was taken from the reaction at each 
time point (0, 5, 15 and 30 min) after addition of trypsin. Protein samples were 
mixed with protein loading buffer and resolved by SDS-PAGE. To compare 
trypsin sensitivity of FRQ from different strains, experiments were performed side 
by side and the protein samples were transferred to the same membrane for 
western blot analysis. 
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Non-optimal codon usage is a mechanism to achieve 
circadian clock conditionality 


Yao Xu, Peijun Ma!, Premal Shah’, Antonis Rokas!, Yi Liu® & Carl Hirschie Johnson! 


Circadian rhythms are oscillations in biological processes that 
function as a key adaptation to the daily rhythms of most environ- 
ments. In the model cyanobacterial circadian clock system, the core 
oscillator proteins are encoded by the gene cluster kaiABC'. Genes 
with high expression and functional importance, such as the kai 
genes, are usually encoded by optimal codons, yet the codon-usage 
bias of the kaiBC genes is not optimized for translational efficiency. 
We discovered a relationship between codon usage and a general 
property of circadian rhythms called conditionality; namely, that 
endogenous rhythmicity is robustly expressed under some envir- 
onmental conditions but not others’. Despite the generality of 
circadian conditionality, however, its molecular basis is unknown 
for any system. Here we show that in the cyanobacterium Syne- 
chococcus elongate, non-optimal codon usage was selected as a 
post-transcriptional mechanism to switch between circadian and 
non-circadian regulation of gene expression as an adaptive response 
to environmental conditions. When the kaiBC sequence was experi- 
mentally optimized to enhance expression of the KaiB and KaiC 
proteins, intrinsic rhythmicity was enhanced at cool temperatures 
that are experienced by this organism in its natural habitat. 
However, fitness at those temperatures was highest in cells in which 
the endogenous rhythms were suppressed at cool temperatures as 
compared with cells exhibiting high-amplitude rhythmicity. These 
results indicate natural selection against circadian systems in cya- 
nobacteria that are intrinsically robust at cool temperatures. 
Modulation of circadian amplitude is therefore crucial to its adap- 
tive significance’. Moreover, these results show the direct effects of 
codon usage on a complex phenotype and organismal fitness. Our 
work also challenges the long-standing view of directional selection 
towards optimal codons*”’, and provides a key example of natural 
selection against optimal codons to achieve adaptive responses to 
environmental changes. 

Most amino acids are encoded by several codons, and species vary in 
their preferences for specific codons for the same amino acid. This 
preference, or codon-usage bias, is thought to reflect a balance between 
mutational biases and selection for translational efficiency and accu- 
racy**°. Although the relative importance of various factors affecting 
codon usage is debated'*"’, the degree of codon-usage bias is known 
to increase with higher gene expression’’. In particular, genes under 
stronger selection for translational efficiency and/or accuracy show 
greater preference for using codons that have complementary transfer 
RNAs with higher abundances. Despite the fact that codon-usage bias 
is a strong indicator of selection on genes’’ and that the specific nature 
of codon bias can be changed by environmental factors such as tem- 
perature and hypersalinity’*”*, its direct effect on a complex phenotype 
and organismal fitness remains largely unknown. 

In the cyanobacterium S. elongatus PCC 7942, circadian rhythms 
confer a strong selective advantage in rhythmic environments*’®. 
Expression of the entire S. elongatus genome is controlled by the 
circadian pacemaker, as shown by rhythms of promoter activity’, 
messenger RNA abundance'*”, and the topology of the entire 


chromosome’””’. The core circadian clock is composed of three com- 
ponents, KaiA, KaiB and KaiC, that are expressed as monocistronic 
kaiA and dicistronic kaiBC transcripts’. Both transcripts are expressed 
at very high levels, falling within the top 5% of mRNA abundances in 
S. elongatus'*". Whereas initial examination of codon usage in kaiBC 
suggested an unusual codon bias (Supplementary Table 1 and Fig. 1a), 
further analyses indicated that amino acid-specific codon usage in kaiB 
and kaiC is not significantly different from that of the average codon 
usage in the rest of the genome (Fig. 1b). To quantify the degree of 
selection on the kai gene cluster, we focused on its codon adaptation 
index (CAI)** and the 5’ mRNA folding energies of its transcripts. 
Given that kaiBC transcripts are highly abundant, it was surprising 
that the CAI for kaiB and kaiC is less than the average CAI value of all 
S. elongatus genes (Fig. 1b). Moreover, even though the folding energy 
of the kaiB transcript is less negative than that of most transcripts 
(Fig. 1c)—indicating that it has a relatively weaker secondary structure 
than most mRNAs in this organism and is therefore likely to initiate 
translation efficiently—the weak secondary structure of the kaiB trans- 
cript is insufficient to account for its high mRNA abundance given that 
the CAI is considerably below average. The kaiC portion of the kaiBC 
transcript has similar CAI (Fig. 1b) and folding energy (Fig. 1c) values 
to those of the kaiB portion. 

To test whether the lower CAI of kaiBC might be adaptive in the 
core clock mechanism and/or in the output pathways controlled by 
the pacemaker in cyanobacteria, we generated two strains in which the 
endogenous kaiBC gene was replaced with modified versions of kaiBC 
where codon usage was ‘optimized’ to be similar to that of highly 
expressed genes'*"’, thereby increasing their CAI values from 0.60 to 
0.95 for kaiB and 0.61 to 0.82 for kaiC (Fig. la and Supplementary 
Tables 2 and 3). Changing the codon usage of a gene in its 5’ region 
affects its folding energy and hence its rate of translation initiation**7™. 
Indeed, the optimized versions of kaiB and kaiBC also had signifi- 
cantly lower 5’ folding energy than the kaiBC transcript (Fig. 1c, d). 
Because the kaiBC gene is transcribed as a single discistronic mRNA 
with kaiB at its 5’ end’, one optimized strain (optKaiB) replaced the 
entire wild-type kaiB“’ gene with an optimized kaiB’” gene, whereas 
the other optimized strain (optKaiBC) replaced both the kaiB” gene 
and the 5’ half of the kaiC'’” gene with optimized versions (kaiB’”" 
and kaiC"'; Fig. 2a). Our initial hypothesis was that the ‘non-optimal’ 
codon bias of the endogenous kaiBC gene is essential for the expres- 
sion of circadian rhythmicity, which would predict that the intrinsic 
rhythmicity of the optKaiB and optKaiBC strains would be poorer 
than that of wild type (as is true for the studies of the Neurospora 
FRQ protein”). Unexpectedly, the observed circadian rhythms of gene 
expression in the optKaiB and optKaiBC strains were as robust as 
those of the wild-type S. elongatus at the optimal growth temperature 
of 30 °C (Fig. 2b). However, to be adaptive, circadian clocks must be 
able to keep time accurately over the range of physiological tempera- 
tures for a given organism”®, so we tested real-time gene expression 
profiles of the optimized strains relative to wild type at tempera- 
tures from 18°C to 38°C. Again, to our surprise the optKaiB and 
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Figure 1 | Non-optimal codon usage of cyanobacterial clock genes. a, CAI 
values along the entire length of different genes at a sliding window of 20 
codons. Left, kaiB'’” versus kaiB°?". Middle, KaiC-I domain (encoded by the 5’ 
half of the kaiC gene) of kaiC'’” versus kaiC?”". Right, average of all ribosomal 
protein genes. b, Comparison of CAI values of the kaiB™', kaiB”', kaiC’'’, 


optKaiBC strains exhibited robustly rhythmic gene expression over a 
broad range of temperatures (Fig. 2b and Supplementary Table 4). By 
contrast, the rhythm of the wild-type strain damps within a few cycles 
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kaiC’*’ and ribosomal genes in the CAI histogram distribution of the genome. 
c, Calculated 5’ folding energy of the mRNA for all genes in the S. elongatus 
genome. d, Comparison of calculated minimum free energy of folding (AG) 
over a range of temperatures between kaiB™” and kaiB’". 


at cool temperatures (18-23 °C; Fig. 2b and Supplementary Table 4). 
There are no significant differences in the free-running period of 
the circadian rhythms between wild-type and optKaiB/optKaiBC at 
18-20 °C and 26-32 °C, and only small differences at other tempera- 
tures in the 18-38 °C range (Fig. 2c). Overall, the temperature com- 
pensation of the optimized strains was slightly poorer (Qjo ~ 1.13) 
over the range of 23-36 °C than that for wild type (Qio ~ 1.10 from ref. 
27, and Qio ~ 1.04 from Fig. 2c and Supplementary Table 6), which 
might relate to the difference in mRNA folding energies (Fig. 1d). 
We further confirmed that improved rhythmicity at lower tempera- 
tures of the optKaiB and optKaiBC strains (Fig. 2b) was due to higher 
protein production levels of KaiB and KaiC. First, we confirmed that 
KaiC levels are considerably higher in the optKaiBC strain than in the 
wild type (Fig. 3a and Supplementary Fig. 1), and showed that KaiB 
levels are similarly increased in both the optKaiB and optKaiBC strains 
(Fig. 3b and Supplementary Fig. 2). Interestingly, the amplitude of the 
KaiC phosphorylation rhythm is comparable between wild type and 
optKaiBC at 20°C (Fig. 3a). Second, to validate that the augmented 
KaiB and/or KaiC levels were responsible for the improved rhythmi- 
city at cool temperatures, we co-expressed the native (non-optimized) 
sequences of kaiB (strain KaiB"/OX) or kaiBC (strain KaiBCY "/OX) 
from an isopropylthiogalactoside (IPTG)-inducible trc promoter 
to enhance endogenous production of KaiB and KaiC (Fig. 3c). 
Figure 3d shows that KaiB is overexpressed in the KaiB”'/OX and 


Figure 2 | Conditional circadian phenotypes of the kai-optimized strains. 
a, Diagrams of genes in the wild-type kaiABC (WT), kaiB-optimized (optKaiB) 
and kaiBC-optimized (optKaiBC) strains (see Methods for a detailed 
description). Cm‘, chloramphenicol resistance; kaiBCp, kaiBC promoter; 
luxAB, reporter cassette containing the JuxA and luxB genes; NS II, neutral site 
II; Spec’, spectinomycin resistance. b, Luminescence rhythms of wild-type and 
kai-optimized strains in constant light at the indicated temperatures. In vivo 
luminescence rhythms were monitored from a group of 12 colonies for each 
strain, and a representative example is shown for each group. ¢, Free-running 
periods of luminescence rhythms in constant light from different strains over a 
temperature range of 18-38 °C. Data are mean + s.e.m. 
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KaiBCW"/OxX strains, even in the absence of the IPTG inducer (owing 
to the trc promoter being slightly leaky”*), whereas KaiC levels are not 
altered. In the presence of a very low concentration of the inducer 
(5uM IPTG), KaiB levels are enhanced in KaiB7/OX, and both 
KaiB and KaiC levels are increased in KaiBC '/OX (Fig. 3d). KaiA 
levels are not substantially affected under any of these conditions. In 
rhythm assays at 18°C, optKaiB and optKaiBC exhibit improved 
rhythmicities compared with wild type, as noted above, but so does 
the KaiBC'"/OX strain in the absence and presence of the inducer 
IPTG (Fig. 3e and Supplementary Table 5). (Much higher concen- 
trations of IPTG lead to arhythmicity as noted previously’; see 
Supplementary Fig. 3.) Moreover, although the data depicted in 
Fig. 3e are normalized luminescence data, unnormalized data for an 
equivalent experiment at 20°C are shown in Supplementary Fig. 4. 
Conversely, the KaiB"/OX strain rhythm damps rapidly. There is a 
clear correlation between the strains that exhibit sustained rhythmicity 
in constant conditions (an indicator of the endogenous circadian sys- 
tem) and a favourable KaiB abundance relative to KaiC abundance 
(optKaiB, optKaiBC and KaiBC''/OX) versus the strains in which 
rhythmicity damps rapidly and that express either a low (wild type) or 
a high (KaiB7/OX) KaiB level relative to the KaiC level (Fig. 3d, e). 
These experimental manipulations of KaiB and KaiC levels using non- 
optimized sequences strongly suggest that codon optimization of kaiB 
and kaiC affects the robustness of rhythmicity at cool temperatures 


primarily by affecting KaiB and KaiC expression, thereby altering the 
relative levels of KaiB, KaiC and KaiA protein abundances in the cells. 
The stoichiometry among the Kai proteins is known to be crucial for 
expression of the cyanobacterial in vitro oscillator’, and it is therefore 
likely to be a key determining factor in the expression of rhythmicity in 
vivo that we report here. 

If alternative (that is, ‘optimal’) kaiBC sequences promote rhythmi- 
city at cooler temperatures, why have not they been naturally selected? 
Although 30°C is the optimal growth temperature for S. elongatus 
(Supplementary Fig. 5), 18-23 °C is certainly a temperature range that 
this freshwater, temperate cyanobacterium could experience in its 
environment. We therefore tested the growth rates of S. elongatus in 
12-h light/12-h dark cycles at 37,34, 30, 25, 20 and 18 °C (Fig. 4 and 
Supplementary Fig. 5). Consistent with our previous results using 
competition assays between wild type and the arhythmic strain 
CLAD, as well as the highly damped strain CLAc’ (both CLAb and 
CLAc result from point mutations in the kaiC gene’), the wild-type 
strain grew at a faster rate in light-dark at 30°C than the CLAb or 
CLAc strains (Fig. 4b and Supplementary Table 7). The optKaiBC 
strain grew at about the same rate as wild type or perhaps slightly 
faster. At cooler temperatures, however, the results were markedly 
different. At 18°C and 20°C when the circadian rhythm of gene 
expression is damped in wild type (Figs 2b, 3e and Supplementary 
Fig. 4), the wild-type strain grew significantly faster than optKaiBC 
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Figure 3 | Kai protein expression and circadian regulation of cells 
expressing wild-type or optimized versions of kaiBC. a, KaiC immunoblots 
in wild-type and optKaiBC strains in constant light (LL) at 20 °C. Densitometry 
reveals the KaiC phosphorylation profiles (right; hyper-P, 
hyperphosphorylated); the KaiC abundance patterns (bottom left, relative to 
the maximum density band, optKaiBC at 42h constant light); and the total 
KaiC abundance at three temperatures (20, 25 and 30 °C; bottom right and 
Supplementary Fig. 1) in the optKaiBC strain from 24-54 h constant light 
relative to wild type. b, KaiB immunoblots in wild-type, optKaiB and optKaiBC 
strains in constant light at 20 °C. A nonspecific band (NB) was used as an 
internal reference for KaiB density. Densitometry reveals relative KaiB 
abundance (ratio of KaiB density to the nonspecific band, bottom left) and total 
KaiB abundance at three temperatures (20, 25 and 30 °C; bottom right and 
Supplementary Fig. 2) in the optKaiB and optKaiBC strains from 24- 54h 


118 | NATURE | VOL 495 | 7 MARCH 2013 


okKaiC pkaiA 


(Crea a SCT BOOT ZS4 SOT OS 
Days in constant light 
constant light relative to wild type. c, Diagrams of the kaiB™’- or kaiBC™’’-co- 
expressing strains. Km’, kanamycin resistance; trcp, trc promoter. d, Increase of 
KaiB and/or KaiC levels in kaiB’- or kaiBC'’’-co-expressing strains 
(KaiB7/OX and KaiBC7/OX) at 12h constant light with/without 5 1M 
IPTG. Left panel shows immunoblot assays for KaiB, KaiC and KaiA; equal 
loading was confirmed by the nonspecific band. Right panel depicts the 
densitometry of relative KaiA, KaiB and KaiC protein abundances. 
e, Phenocopying of the cool-temperature rhythmicity of the kaiB- or kaiBC- 
optimized strains in the wild-type strain by increased expression of the kaiB”* 
and kaiC'’’ genes. Luminescence was recorded in constant light at 18 °C 
from cultures of wild-type, codon-optimized (optKaiB and optKaiBC), 
and kai-co-expressing (KaiB‘“’ "/OX and KaiBC™’"/OX) strains in the presence 
or absence of IPTG (5 1M). Representative traces are shown for each case. 
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Figure 4 | Optimizing the kaiBC sequence causes slower growth rate at cool 
temperatures. a-e, Wild-type, optkaiBC, CLAb (arhythmic) and CLAc 
(damped oscillation) strains were grown in 12-h light/12-h dark cycles at 37 °C 
(a), 30°C (b), 25 °C (c), 20 °C (d) or 18 °C (e), with constant air bubbling and 
shaking. Cell densities were monitored by measuring attenuance at 750 nm 


(Fig. 4d, e and Supplementary Table 7). Even the arhythmic CLAb and 
damped CLAc strains outperformed optKaiBC at 18-25 °C (Fig. 4c-e 
and Supplementary Table 7). Therefore, at cool temperatures in light- 
dark cycles, strains having damped (wild type and CLAc) or arhythmic 
(CLAb) phenotypes under free-running conditions outgrew the strain 
that expressed robust rhythms (optKaiBC). 

Biases in codon usage are generally thought to be under directional 
selection for an optimal balance between translational efficiency 
and accuracy, in which a higher CAI is always better*’. Other examples 
of selection for non-optimal codon usage (mediated by mRNA 
secondary structure and/or tRNA availability) are rare and poorly 
characterized**’. Our study presents a counter-example to the stan- 
dard view, and suggests the action of either selection against optimal 
codon usage, or stabilizing selection in which both low and high 
extremes in codon-usage bias have higher fitness costs. Indeed, the 
non-optimal codon usage seems to be a molecular mechanism in 
which post-transcriptional events allow S. elongatus cells to switch 
between circadian and non-circadian regulation of gene expression 
depending on the environmental conditions, and our results are remi- 
niscent of earlier observations of such ‘conditionality’ for circadian 
rhythms*”’. 

Why is conditionality an important characteristic of circadian 
rhythms? We tend to think of circadian clocks as rhythmic activators, 
but they are also rhythmic repressors. Under some conditions this 
repressor/activator balance may inhibit growth and circadian regu- 
lation may not be adaptive. At cooler temperatures, the free-running 
period of the circadian rhythm of S. elongatus can be as long as 30h 
(Fig. 2c), suggesting that the coupling of gene expression and rhythmic 
regulation might be maladaptive for growth at lower temperatures. For 
example, we previously reported that S. elongatus mutant strains with 
30h period lengths can entrain to 24 h light-dark cycles, but they do so 
with a considerably later phase relationship that is maladaptive’’; a 
result that fits with our current observations. As another example of 
conditionality, our previous competition experiments at 30 °C found 
that the arhythmic CLAb strain is more fit than wild type under 
constant illumination, but is rapidly outcompeted by wild type in 
light-dark cycles’ (Fig. 4 shows that the growth of CLAb is poorer 
than wild type in light-dark cycles at 20-30 °C). Therefore, both illu- 
mination and temperature are environmental parameters that demon- 
strate the ‘conditional’ advantages of circadian regulation under some 
conditions but not others. 

Our data show that optimizing the codon usage for the circadian 
kaiBC genes of S. elongatus does not disrupt endogenous circadian 
regulation (as in the case of the Neurospora FRQ protein’’), but instead 
the sequence optimization enhances circadian regulation in a range of 
cool temperatures that are relevant for the ecology of this organism. 
Despite this enhancement of intrinsic rhythmicity, however, optimal 
codon usage at the kai locus impairs cell growth at cooler tempera- 
tures. Therefore, our data suggest selection against an optimal codon 
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doubling time calculations, see Supplementary Table 7 and Supplementary 
Fig. 5.) 


usage because it is incompatible with a post-transcriptionally modu- 
lated conditional suppression of circadian rhythmicity at cool tem- 
peratures. In S. elongatus, circadian conditionality allows expression of 
robust endogenous rhythmicity in the range of temperatures that per- 
mit vigorous growth and suppress rhythmicity at temperatures in 
which growth of this species is minimal (Supplementary Fig. 5). Our 
observations provide a new example of post-transcriptional regulation 
of circadian clock genes that confers an adaptive response to different 
environmental conditions. 


METHODS SUMMARY 


Frequencies of codon usage were analysed from all putative proteins in the whole 
genome of Synechococcus elongatus PCC 7942, all 59 ribosomal genes in the 
genome (Supplementary Table 2), and 16 highly expressed genes that show high 
microarray values from both data sets (Supplementary Table 3). The fractions of 
codon usage were based on usage frequencies per 1,000 codons (Supplementary 
Table 1). The overall relative synonymous codon usage (RSCU) values of the 
genome were calculated from 3,261 coding sequences from the genome (Sup- 
plementary Table 8), and infrequently used codons in kaiB or kaiC were changed 
to those coding for the specific amino acids with higher RSCU values (Sup- 
plementary Tables 9 and 10) or higher codon-usage fractions (Supplementary 
Table 1) in the genome. DNA fragments containing optimized kaiB or kaiC coding 
sequences (optKaiB or optKaiC) were synthesized and cloned into a plasmid 
containing the wild-type kai cluster DNA to replace the corresponding wild-type 
DNA sequences. The CAI values of all genes in the genome and the codon- 
optimized versions of the kai genes were calculated based on RSCU values. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Evaluation of codon usage. To evaluate the frequency of codon usage of the 
central clock genes kaiABC, we analysed coding sequences from different groups. 
One group is from 2,400,255 residue sequences from all putative proteins in the 
whole genome of Synechococcus elongatus PCC 7942. The second group is from all 
59 ribosomal genes in the genome (Supplementary Table 2). We also analysed 
microarray data sets from two independent laboratories'*’”. First, we calculated 
total microarray signals from one circadian cycle in constant light for each of these 
genes. Then, all of these genes were rearranged from strongest to weakest mRNA 
abundances based on the total microarray signal values in constant light. Finally, 
we selected the top 16 genes that show high microarray values from both data sets 
(Supplementary Table 3), and combined all of these coding sequences to the third 
group as putative highly expressed genes. The codon usage frequency was analysed 
with a web-based program from the Sequence Manipulation Suite of Bio- 
informatics.org (http://www.bioinformatics.org). The fractions of codon usage 
were based on usage frequencies per 1,000 codons (Supplementary Table 1). 
Codon optimization of kai genes. Relative synonymous codon usage (RSCU) is 
defined as the ratio of the observed frequency of codons to the expected frequency, 
given that all the synonymous codons for the same amino acids are used 
equally'***. In the absence of any codon-usage bias, the RSCU value would be 
1.00. A codon that is used less frequently than expected will have a value of less 
than 1.00, and vice versa for a codon used more frequently than expected. The 
overall RSCU values of S. elongatus PCC 7942 were calculated from 3,261 coding 
sequences (990,021 codons) from the genome (Supplementary Table 8). 
Infrequently used codons in kaiB or kaiC coding sequences were changed to those 
coding for the specific amino acids with higher RSCU values (Supplementary 
Tables 9 and 10) or higher codon-usage fractions (Supplementary Table 1) in 
the genome. As shown in Supplementary Fig. 6 and Supplementary Table 9, 67 
out of 102 codons were optimized for the entire kaiB coding region, whereas 
for the kaiC gene, the infrequently used codons were mainly optimized in the 
amino-terminal KaiC-I domain that is encoded by the 5’ half of the kaiC gene 
(Supplementary Fig. 7 and Supplementary Table 10). 

Synthesis and construction of optimized kai genes. DNA fragments containing 
optimized kaiB or kaiC coding sequences with wild-type flanking sequences were 
commercially synthesized and cloned into the SmaI site of pUC57 (EZBiolab) to 
produce pUCoptKaiB or pUCoptKaiC, respectively. On the basis of pUCoptKaiB 
or pUCoptKaiC templates, the optKaiB or optKaiC fragments were resyn- 
thesized using 12-18 thermal cycles with pfuUltra High-fidelity DNA polymerase 
(Stratagene) and primers containing the corresponding wild-type flanking 
sequences. After purification, the optKaiB or optKaiC fragments were EZcloned 
(Stratagene) into a plasmid containing the wild-type kai cluster DNA to replace 
the corresponding wild-type DNA sequences. After the unchanged parental plas- 
mid was digested at 37 °C for 1 h with Dpn I (New England Biolabs), the circular, 
nicked optimized double-stranded DNA was transformed into Escherichia coli to 
generate pKai-optKaiB and pKai-optKaiC, respectively. To optimize the kaiB and 
kaiC genes, the resynthesized optKaiB fragment was EZcloned into the pKai- 
optKaiC plasmid to replace the corresponding wild-type kaiB coding sequences 
and produce pKai-optKaiBC. All of these kai-optimized constructs were con- 
firmed by DNA sequencing analysis. In case of direct comparison, the wild-type 
versions of the kaiB, kaiC and kaiBC genes or coding sequences are indicated as 
kaiB™", kaiC’ and kaiBC™”, whereas the codon-optimized versions are denoted 
as kaiB°?', kaiC?' and kaiBC?"", respectively. 

Computation of CAI and 5’ mRNA folding energy. To calculate CAI values of 
genes, we began by calculating the RSCU of all ribosomal genes’* (Supplementary 
Table 2). Using these RSCU values, we calculated the CAI values of all genes in the 
S. elongatus genome and the codon-optimized versions of the kai genes. Using a 
sliding window of 20 codons, Fig. 1a shows that optimized versions of both kaiB 
(kaiB°"’) and kaiC (kaiC’?') genes have higher CAI values along the entire length 
of the genes than the wild-type versions, and higher than the average CAI of 
ribosomal genes. The CAI of the wild-type kaiB and kaiC genes is less than the 
average CAI of the genome, in spite of their high abundance in the transcriptome. 
As expected, the average CAI of ribosomal genes is much higher than the average. 
Asan example, the CAI of kaiB”” was optimized to 0.95 as shown by the red line in 
Fig. 1b using the RSCU of ribosomal genes. 

In addition to codon usage as it relates to the relative expression of various tRNA 
genes, the translational efficiency of the production of a protein depends on the 5’ 
folding energy ofits mRNA. To calculate the minimum free energy (AG) of folding 
of the 5’ region of mRNAs, we used the first 40 nucleotides of the coding sequences 
(1-40 nucleotides). In addition, we used the RNA folding algorithm in RNA fold of 
the Vienna RNA package 2.0 (ref. 33). We calculated AG at 37 °C for each gene in 
the S. elongatus genome using default parameters. Then, to calculate the effect of 
temperature on folding energy for the kai genes (wild-type and optimized ver- 
sions), we varied the temperature from 18 to 40°C. The folding energy of the 
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kaiB'’” and kaiC'’" genes is much higher than the rest of the genome, indicating 
selection for faster translation initiation (Fig. 1c). Because the folding energy of an 
mRNA depends on the temperature, calculation of the minimum free energy of 
folding (AG) at varying temperatures showed that AG for both kaiB”’” and kaiB* 
increases with temperature, and that optimizing codon usage also increases the AG 
of the kaiB’”' gene at all temperatures (Fig. 1d). 

Generation of kai-optimized and kai-co-expressing cyanobacterial strains. S. 
elongatus PCC 7942 was used as the cyanobacterial host strain. A kaiBCp::luxAB 
luminescence reporter of clock-controlled gene expression was integrated into 
either neutral site I (NS I) with a spectinomycin selection marker (or in other 
experiments, into NS II with a chloramphenicol selection marker), in which the 
expression of the Vibrio harveyi luciferase structure gene cassette [uxAB is driven 
by the promoter of the kaiBC genes (kaiBCp) and serves as a real-time reporter of 
promoter activity'’”’*. The kaiABC-null strain was created by replacement of the 
kaiABC DNA region with a kanamycin-resistance gene’. The wild-type kaiABC 
cluster or codon-optimized kaiABC cluster containing a kaiB?’, kaiC?’ or 
kaiBC*' coding region and a spectinomycin-resistance cassette was reintroduced 
into the endogenous kai locus by replacing the kanamycin-resistance gene of the 
kaiABC-null strain to give rise to transgenic strains with either the wild-type or the 
optimized version of the kaiABC cluster. This replacement was done in such a way 
that the kaiABC cluster was recreated to be exactly the same as the wild-type 
cluster with no insertions or deletions except a selection marker downstream of 
the genomic kaiC sequence. The transgenic cyanobacterial strain containing the 
wild-type version of the kai cluster was named the wild-type strain, whereas the 
strains containing codon-optimized versions of the kai cluster with kaiB’? ' kaiCe’ 
or kaiBC?' were called optKaiB, optKaiC or optKaiBC, respectively (Fig. 2a). For 
co-expression of wild-type kaiB, kaiC or kaiBC genes in vivo, the trep::kaiB™™, 
trep:kaiC'’* or trep::kaiBC™ constructs with a kanamycin-resistance marker’® 
were transformed into the NS II region of a wild-type luminescence reporter strain 
to generate kai-co-expressing strains KaiB'"/OX, KaiC'/OX or KaiBC"/OX 
(Fig. 3c), in which the co-expression of wild-type kaiB, kaiC or kaiBC genes from 
NS II was under the control of an IPTG-derepressible heterologous trc promoter, 
and the original kaiABC cluster remains at the original wild-type site. These 
codon-optimized strains containing kaiB°?', kaiC’?' or kaiBC*?' and kai-co- 
expressing strains were confirmed by PCR, DNA sequencing, immunobloting 
and luminescence analyses. As indicated in Fig. 2a, the wild-type coding region 
of the kaiB gene was replaced with kaiB’”' coding sequences in the optKaiB strain, 
whereas the kaiBC coding region was replaced with both the kaiB’”’ and kaiC’?' 
(that is, kaiBC??') coding sequences in the optKaiBC strain. At NS II, a kaiBC 
promoter-driving JuxAB expression cassette with a chloramphenicol selection 
marker was integrated to function as a luminescence reporter of clock-controlled 
promoter activity. In the kaiBWT- or kaiBC’'-co-expressing strains in Fig. 3c, 
three sites in the chromosome are depicted: (1) the wild-type kaiABC cluster 
(kaiABC'’"); (2) the IuxAB luminescence reporters in NS J; and (3) an IPTG- 
derepressible promoter driving expression of kaiB”’ (KaiB"/OX strain) or 
kaiBC™” (KaiBC"/OX strain) in NS IL. 

Measurement of luminescence rhythms. Cyanobacterial strains were grown in 
modified BG11 (ref. 34) liquid media with air bubbling or BG11 agar plates 
supplemented with appropriate antibiotics (204gml~' spectinomycin; 10 Wg 
ml! kanamycin; 7.5ugml~’ chloramphenicol) at 30°C under continuous 
cool-white illumination (constant light; 40-50 LE m “s '). Before the cells were 
released into constant light for the luminescence assay, a 12-h dark exposure was 
given to synchronize the rhythms of the individual cells in the population. For 
induction of co-expressed kaiB, kaiC or kaiBC genes, the trcp inducer IPTG 
was added at the beginning of constant light to final concentrations of 0, 2, 5, 
10, 100 or 1,000uM. Luminescence was measured with the kaiBCp::luxAB 
reporter of clock-controlled gene expression that serves as a real-time reporter 
of promoter activity'’”**. For measurement of in vivo luminescence rhythms at 
different temperatures (18-38 °C), at least 12 independent colonies for each con- 
dition were monitored as previously described’***. Our standard light intensity 
for the luminescence rhythm assay was 40-50 }1Em *s |, but we observed that 
the rate of damping in the wild type was influenced by both the temperature 
(as described in this paper) and light intensity (as tested under a range of light 
intensities). 

Calculation of free-running period, Qi» value, and damping rate of lumin- 
escence rhythms. The period of luminescence rhythms was analysed with 
ChronoAnalysis II, version 10.1 (courtesy of T. Roenneberg), and the Qo value 
for evaluation of temperature compensation over a wide range of temperatures was 
calculated with the following equation: Qyy = [(1/t2)/(1/t1)] "> ™; in which 
t, denotes period at the lower temperature (T|), and t2 denotes period at the 
higher temperature (T,)*°. Damping rate is the number of days required for 
the amplitude of the rhythm to decrease to 1/e of the starting value. The damping 
rates were calculated with the LUMICYCLE data analysis program (Actimetrics; 
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courtesy of D. Ferster). The program fits the data to a sine wave multiplied by an 
exponential decay factor’’. 

Immunoblot assays for Kai abundance. After two 12-h light-dark cycles, liquid 
cultures at D759 nm = 0.3 were released to constant light at indicated temperatures. 
About 30 ml of cells were collected at different time points and an appropriate 
amount of fresh medium was added to the culture flask to maintain an equal cell 
density during the time-course experiment. For kaiB- or kaiBC-co-expressing 
strains, the cultures were treated with/without 5 1M IPTG at constant light time 
zero, for 12h before cell collection. Total proteins were extracted as previously 
described**. Total extracts were separated by SDS-PAGE (15% gel for KaiB and 
10% gel for KaiA and KaiC) and transferred onto nitrocellulose membranes. 
Proteins were transferred to nitrocellulose for immunoblotting using polyclonal 
rabbit antisera raised against KaiA or KaiB* or using polyclonal mouse antisera 
raised against KaiC”*. Equal loading of extracts was confirmed by Coomassie blue 
staining in the gel, Ponceau red staining on the membrane, and/or by the density of 
nonspecific bands on the immunoblots. The immunoblot signals for relative Kai 
protein density/abundance were analysed with National Institutes of Health 
Image J software. 

Determination of growth rate and doubling time. Growth rates of cyanobac- 
terial strains, including wild type, optKaiBC, as well as two arhythmic/damped 
kaiC mutants (CLAb and CLAc)’”, were measured in parallel at 18, 20, 25, 30, 34 
and 37 °C. Initial cultures were grown in liquid BG-11 medium at 30°C under 
constant illumination (50 HE ms ')ina shaking water bath at 100r.p.m. and 
with air bubbling into the cultures. Cell densities were monitored by measuring the 
attenuance at 750nm (D759 nm). When cell densities reached D759 pm = 0.8, cul- 
tures were diluted to D759 nm = 0.005, and grown in 12-h light-dark cycles (12h 
light (50 LE m 7s‘) followed by 12h darkness) in water baths set to 18, 20, 25, 30, 
34 or 37°C with shaking (100r.p.m.) and air bubbling. Cell densities were 


determined at D759 nm Over a time course as indicated. When D750 nm values of 
cell cultures exceed 0.9, the attenuance measurement is not linear with cell density. 
Therefore, for samples with a D759 nm that was larger than 0.9, the samples were 
diluted to a D750 nm that was within the linear range before attenuance determina- 
tion (and the plotted D value is then corrected for the dilution). Two-to-six 
independent experiments were performed for each strain, and the growth curves 
were plotted as average D759 nm Values over time in light-dark. Doubling time was 
calculated by fitting exponential curves to the growth curves. Growth rate was 
generated by fitting growth data to exponential curves (cell density at a specific 
time = initial cell density x ev" "© * "™°) (see http://mathworld.wolfram. 
com/LeastSquaresFittingExponential.html). Doubling time was calculated as: 
doubling time (h) = (In(2)/growth rate) x 24. 

Statistical analyses. A two tailed Student’s t-test was used for statistical analyses. 
*P<0.05; **P<0.01. 
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CPEBI coordinates alternative 3’-UTR formation 
with translational regulation 


Felice- Alessio Bava!, Carolina Eliscovich’+*, Pedro G. Ferreira?*, Belen Mifiana, Claudia Ben-Dov’, Roderic Guigo’, 


Juan Valcarcel?? & Rail Méndez!? 


More than half of mammalian genes generate multiple messenger 
RNA isoforms that differ in their 3’ untranslated regions (3’ UTRs) 
and therefore in regulatory sequences’, often associated with cell 
proliferation and cancer®’; however, the mechanisms coordinating 
alternative 3'-UTR processing for specific mRNA populations 
remain poorly defined. Here we report that the cytoplasmic poly- 
adenylation element binding protein 1 (CPEB1), an RNA-binding 
protein that regulates mRNA translation’, also controls alternative 
3’-UTR processing. CPEB1 shuttles to the nucleus”®, where it co- 
localizes with splicing factors and mediates shortening of hundreds 
of mRNA 3’ UTRs, thereby modulating their translation efficiency 
in the cytoplasm. CPEB1-mediated 3'-UTR shortening correlates 
with cell proliferation and tumorigenesis. CPEB1 binding to pre- 
mRNAs not only directs the use of alternative polyadenylation 
sites, but also changes alternative splicing by preventing U2AF65 
recruitment. Our results reveal a novel function of CPEB1 in medi- 
ating alternative 3’-UTR processing, which is coordinated with 
regulation of mRNA translation, through its dual nuclear and 
cytoplasmic functions. 

Alternative 3'-UTR processing to produce mRNA isoforms can 
occur by either alternative splicing of terminal exons and/or alternative 
polyadenylation (APA). APA, which is significantly more frequent 
than alternative splicing’, usually occurs by activation of ‘weaker’ poly- 
adenylation signals (PAS) located 5’ of ‘stronger’ PAS, the latter tend- 
ing to be used by default®. In general, longer 3’ UTRs display reduced 
translation, for example, due to microRNA target sites present in 
longer transcripts but not in shorter variants**. Transcripts encoding 
proto-oncogenes and other cell proliferation proteins tend to have 
longer 3’ UTRs in non-proliferative tissues and shorter variants in 
transformed cells or highly proliferative tissues”"’. 

Nuclear polyadenylation requires two core sequence elements': the 
canonical PAS, recognized by the cleavage and polyadenylation 
specificity factor (CPSF), and a G/U-rich element, bound by the 
cleavage-stimulating factor. Other upstream accessory sequence ele- 
ments modulate the use of suboptimal or non-consensus sites. Specific 
cytoplasmic mRNAs undergo elongation of their poly(A) tails in a 
cleavage-independent manner. Cytoplasmic polyadenylation uses the 
same PAS as in nuclear polyadenylation as well as an upstream cyto- 
plasmic polyadenylation element (CPE)*. Cytoplasmic polyadenyla- 
tion is mediated by a CPE-binding protein (CPEB1), which recruits 
CPSF to the PAS”; they then jointly recruit the cytoplasmic poly(A) 
polymerase GLD2"*. CPE-mediated translational control targets 
10-20% of the genome, particularly transcripts encoding cell cycle 
and cell differentiation regulators'*”. CPEB1 complexes are, at least 
in part, preassembled in the nucleus before their transport to the 
cytoplasm**. Although the nucleo-cytoplasmic distribution of CPEB1 
varies (Supplementary Fig. la), in the stage IIIb Hodgkin’s lym- 
phoma-derived cell line (HD-MyZ) CPEB1 is mainly nuclear and 


largely co-localizes with the splicing factor SFRS2"; this suggests a 
potential functional link between nuclear CPEB1 and pre-mRNA pro- 
cessing (Fig. la). The specificity of CPEB1 signal was confirmed by 
transfection with an inducible CPEB1 short hairpin (sh)RNA’’, which 
resulted in a strong reduction of CPEB] protein levels and of the nuclear 
fluorescent signal (Supplementary Fig. 1a, b). We found that this nuc- 
lear CPEB1 interacts with CPSF by cross-coimmunoprecipitation in 
RNase-treated and CPEB1-supplemented HeLa nuclear extracts (Sup- 
plementary Fig. 1c). 

To address whether CPEB1 could regulate alternative splicing, we 
compared transcripts from control and CPEB1-knockdown (CPEB1- 
KD) HD-MyZ cells using a splicing-sensitive microarray covering 
> 1,800 alternative splicing events in almost 500 genes important for 
cell proliferation, cancer progression and pre-mRNA splicing, includ- 
ing 113 events affecting 3’ terminal exons’’’’. We identified 139 
CPEB1-dependent changes affecting 81 genes (Supplementary Table 
1). Of those, 17 events were in 3’ terminal exons for 6 genes. In five of 
the cases, a tandem CPE/alternative PAS was found in the proximity of 
an alternative 3’ splice site that becomes more used after CPEB1 
knockdown. In others, CPEB1-enhanced cleavage eliminated the most 
distal alternative 3’ splice site and in the last the CPEs/alternative PAS 
were upstream of an alternative 5’ splice site that is more used upon 
CPEB1 knockdown (Supplementary Fig. 2a). The changes in alterna- 
tive splicing were confirmed by reverse transcription followed by 
quantitative PCR (RT-qPCR) using splice junction-specific oligonu- 
cleotides (Supplementary Fig. 2b). 

3'-UTR processing of BUB3 mRNA results in five possible alterna- 
tive 3’ UTRs generated from a combination of two alternative 3’ splice 
sites and four alternative PAS (Fig. 1b). Expressed-sequence-tag evi- 
dence supports the tissue-specific expression of these variants, and 
BUB3 mRNA has been proposed to be translationally regulated by 
CPEB1"°. Indeed, BUB3 pre-mRNA contains two CPEs immediately 
downstream of the first 3’ splice site and PAS. We determined the 
BUB3 3'-UTR variants generated in the presence or absence of CPEB1 
by RT-PCR, with oligonucleotide primers for the constitutive exon 7 
and the poly(A) tail, followed by Southern blot for the common exon 7. 
To identify each of the 3’-UTR variants, the blots were re-hybridized 
with junction-probes complementary to the two possible splice sites 
(3’ SS1 and 3’ SS2 in Fig. 1c) and with probes complementary to 
sequences just upstream of the four possible cleavage sites (PAS1 to 
PAS4 in Fig. 1c). Quantification of the different UTRs labelled with 
exon 7 (Fig. 1d; see Methods for the quantification procedure) con- 
firmed a small but reproducible increase in the use of 3’ SS1 relative to 
3’ SS2 on CPEB1 knockdown (Fig. 1b, lanes 1 and 2 for PAS and 3 to 6 
for alternative splicing). Independent quantification using splicing 
specific RT-qPCR (Fig. 1d and Supplementary Fig. 3a) showed a sim- 
ilar increase in the 3'-SS1/3’-SS2 ratio on CPEB1 knockdown. In 
addition, the quantification of each variant labelled with the 3’ SS1 
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Figure 1 | Nuclear CPEB1 co-localizes with SFRS2 and regulates the 
alternative 3’-UTR formation of BUB3 mRNA. a, Immunofluorescence of 
CPEB1 and SFRS2 in HD-MyZ cells. DNA staining (DAPI) in blue, CPEB1 
staining in green, SFRS2 staining in red and the merged signals are shown. Scale 
bar, 10 um. b, Schematic representation of BUB3 3’ UTR. Two alternative 3’ 
splice sites (3’ SS1 and 3’ SS2), four PAS (PASI, 2, 3, 4) and two CPEs (C) are 
indicated. The asterisks indicate stop codons. c, HD-MyZ cells were induced 
(CPEB1-sh+) or not (CPEB1-sh—) with doxycycline. Total RNA was isolated 
and 3’ UTRs from endogenous BUB3 mRNA were amplified by 3’ RACE. The 
cDNA products were analysed by Southern blot using the indicated *”P-labelled 
probes. 3’-SS1-derived 3’ UTRs (solid arrowheads; numbers indicate PAS1, 
PAS2 or PAS3) and 3’ SS2/PAS4 3’ UTR (white arrowheads) are indicated. The 
asterisks indicate unspecific products. d, Quantifications of APA or alternative 
splicing from c (black bars, see Methods) or by RT-qPCR (white bars). Error 
bars, s.d.;n > 3. (P = 0.00016 for alternative splicing endogenous BUB3 mRNA 
in control versus KD cells). e, WT or CPE-mutated BUB3 pre-mRNAs 
coimmunoprecipitated with CPEB1 were detected by intron-specific RT- 
qPCR. Error bars, s.d.; n = 6. 


or PAS1 probes unveiled a significant increase in the use of the PAS2 
and PAS3 relative to PAS1 in absence of CPEB1 (Fig. 1d). CPEB1 
depletion also had an effect on total BUB3 mRNA levels, but not 
in other ‘housekeeping’ transcripts (Supplementary Fig. 3b). These 
results, confirmed with two other shRNAs (Supplementary Fig. 4a, b), 
indicate that CPEB1 on one hand promotes use of the 3’ SS2/PAS4 over 
UTRs using 3’ SS1, and on the other hand promotes 3’ SS1/PAS1 at 
the expense of the 3’ SS1/PAS2 and 3’ SS1/PAS3 variants. 

To determine if these effects were direct, we transfected a minigene 
containing a tagged-exon 7, intron 7 and exon 8a/b and the corres- 
ponding variants with inactivating mutations either in the CPEs 
(Fig. 2a) or in alternative PAS1 and PAS2 (Supplementary Fig. 5). 
Accordingly, the unprocessed pre-mRNAs generated by the wild type, 
but not by the CPE mutant-minigene, coimmunoprecipitated with 
CPEB1 (Fig. le and Supplementary Fig. 6). We then analysed the 
3'-UTR variants generated from the minigenes in control or CPEB1- 
KD cells. Transcripts from the minigene were processed as the endo- 
genous BUB3 transcript and responded equally to CPEB1 depletion 
(Figs 1c and 2a). Quantification of the alternative 3’ UTRs, generated 
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by the wild-type and mutant minigenes and in the presence or absence 
of CPEB1 (Fig. 2b), showed that either inactivation of the CPEs or 
knockdown of CPEB1 changed the selection of the PAS from PAS1 to 
a distributive use of all three PAS1-3 sites (Fig. 2a, b). Interestingly, 
the effects of knocking down CPEB1 or mutating the CPEs on alter- 
native PAS selection were similar and non-additive (Fig. 2b, left 
panel), suggesting that the selection of PAS1 is directly driven by 
CPEB1 binding to the CPEs. For the selection of the 3’ splice acceptor 
site, mutation of the CPEs or CPEB1 knockdown caused a shift 
from the use of 3’ SS2 to 3’ SS1 (Fig. 2a, b). Independent quantification 
using splice junction-specific RT-qPCR showed a similar increase in 
the 3’ SS1/3’ SS2 ratio on CPEB1 knockdown or CPE mutations 
(Fig. 2b). 

For alternative splicing the effect of CPE mutation was partially 
additive to the effect of CPEB1 knockdown, suggesting that CPEB1 
has both direct and indirect effects on BUB3 alternative splicing. The 
observed changes were not due to mRNA stability differences (Sup- 
plementary Fig. 7a) or to nonsense-mediated decay (Supplementary 
Fig. 7b). Mutation of alternative PAS1-2 increased the use of 3’ SS2 
over 3’ SS1 (Supplementary Fig. 5), consistent with competition 
between cleavage and downstream splicing, as the use of PAS] and 
PAS2 will remove 3’ SS2. Additionally, mutation of the CPEs or 
knock-down of CPEBI resulted in the generation of an additional 
splice variant (S3/PAS4) from a cryptic 5’ splice site (Fig. 2a and Sup- 
plementary Fig. 5a, splice junction probe (S3)). To circumvent the 
complexity generated by the combination of APA and alternative 
splicing, we generated a simplified construct containing only the 
three tandem alternative PAS and the CPEs, which was then trans- 
fected in control or CPEB1-KD cells (Fig. 2c). This simplified process- 
ing confirmed that CPEB1 promoted the use of PAS] over PAS2-3, 
independently of splicing. Finally, we used HeLa nuclear extracts 
without or supplemented with recombinant CPEBI1 to assess in vitro 
cleavage. Indeed, cleavage was both CPE- and CPEB1-dependent, 
indicating a direct effect of CPEB binding (Fig. 2d and Supplemen- 
tary Figs 8 and 9). 

To test the mechanism by which CPEB] regulates splicing, we 
performed ultraviolet crosslinking-immunoprecipitation for the U2 
small nuclear ribonucleoprotein particle auxiliary factor 65 kDa 
(U2AF65)”' (Fig. 3a, b and Supplementary Fig. 10), to test its specific 
binding to the polypyrimidine tract (Supplementary Fig. 10d). 
U2AF65 showed a stronger crosslinking to the distal 3'-SS2 site than 
to the proximal 3’-SS1 site, even without CPEs. When the 3’-SS1 
probe was extended to include the CPEs and PASI, crosslinking of 
U2AF65 was undetectable, suggesting that CPEs negatively affect the 
recruitment of U2AF65 to the 3’ SS1. Accordingly, inactivating muta- 
tion of the CPEs’* (— CPEs in Fig. 3a, b) in the extended probe reduced 
CPEB1 crosslinking and restored U2AF65 crosslinking. Mutation of 
PAS in the extended probe (—PAS1 in Fig. 3a, b) also increased 
U2AF65 crosslinking, suggesting that CPEB1-mediated recruitment 
of the polyadenylation machinery to PASI negatively affects the 
recruitment of the splicing machinery to 3’ SS1. Interestingly, in- 
creasing the distance between 3’ SS1 and the CPEs/PAS1 (Spacer in 
Fig. 3a, b) restored U2AF65 crosslinking to the same levels as mutation 
of PAS, pointing to steric hindrance between the splicing and poly- 
adenylation machineries. Taken together, these results indicate that 
CPEB] negatively affects recognition of the proximal 3’ SS1 by splicing 
factors through the recruitment of the polyadenylation machinery 
to PAS1. However, CPEB1 also had additional ‘indirect’ functions 
in splicing by modulating the levels of several splicing factors (Sup- 
plementary Fig. 11) encoded by CPE-regulated mRNAs”, a known 
mechanism of alternative splicing regulation”. 

We next explored the biological relevance of alternative 3'’-UTR 
processing of BUB3 pre-mRNA. First, we analysed whether different 
3' UTRs could mediate different translational efficiencies. We generated 
chimaeric transcripts with the Renilla luciferase open reading frame 
(ORF) fused to each of the five 3’ UTRs identified and co-expressed 
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Figure 2 | CPEs and CPEB1 directly regulate the alternative 3’-UTR 
formation of BUB3 mRNA. a, HD-MyZ cells, stably transfected with a 
doxycycline-inducible CPEB1 shRNA vector, were treated (CPEB1-sh+) or not 
(CPEB1-sh—) with doxycycline and then transfected with BUB3 minigene 
containing WT or mutated CPEs. The minigene-derived 3’ UTRs and the 
negative control (RT —) were analysed and labelled as in Fig. 1c. $3/PAS4 3’ UTR 
(white diamond) is indicated. Asterisks indicate unspecific products. 

b, Quantifications of APA or alternative splicing from c (black bars, see Methods) 
or by RT-qPCR (white bars). Error bars, s.d.; n > 3. (For alternative splicing; 
P= 0.00037 for WT vs (-CPEs) minigene in control cells and P = 0.00030 for 


them with Firefly luciferase as a control. To avoid nuclear processing of 
the reporters (Supplementary Fig. 12), any PAS upstream from the one 
being tested were inactivated. After normalizing for transcript levels 
(see Supplementary Fig. 12 for unnormalized luciferase activities and 
mRNA levels), all three variants generated from the 3’ SS1 (PAS1, 
PAS2 and PAS3) and S3/PAS4 showed less efficient translation than 
the 3'-SS2 variant (Fig. 3c). Thus, the BUB3 transcript variants gener- 
ated in the presence of CPEB] resulted in higher translation. Indeed the 
level of endogenous BUB3 protein synthesized in the presence of 
CPEB1 was significantly higher than in CPEB1-KD cells (Fig. 3d), 
even though the level of BUB3 mRNA increased in the knockdown 
(Supplementary Fig. 3b). BUB3 is an essential component of the 
spindle-assembly checkpoint”’, so CPEB1-KD cells failed to activate 
the spindle-assembly checkpoint (Supplementary Fig. 13). 

Next, we examined how general the nuclear function of CPEB] is in 
alternative pre-mRNA processing. We purified RNA from control and 
CPEB1-KD HD-MyZ cells and analysed 3’ UTRs by high-throughput 
sequencing of the poly(A)-containing RNA fragments, identify- 
ing CPEB1l-dependent APA changes (Fig. 3g and Supplementary 
Fig. 14). For 4,442 3’ UTRs (out of 13,039; see Supplementary 
Information), we could estimate the transcript end. Control cells 


- 
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- + 
i 1 a 1 


PAS| | 


WT minigene in control versus KD cells). ¢, Schematic representation of a 
simplified BUB3 3’ UTR. PAS1, 2, 3 and two CPEs (C) are indicated (left panel). 
HD-MyZ cells were induced with doxycycline as in a, the simplified minigene 
was transfected and the derived 3’ UTRs characterized as in a (middle panel). 
Quantifications of APA are plotted (right panel). Error bars, s.d.; n > 3. d, HeLa 
nuclear extract (nucl. extr.), supplemented or not with recombinant CPEB1, were 
incubated with the indicated radiolabelled RNAs transcribed from the minigene 
shown in ¢, either WT or with mutated CPEs. Then, RNA was extracted and 
resolved in a denaturing acrylamide gel, together with the free probes as controls. 
The cleaved product (PAS1) and the uncleaved product are indicated. 


showed statistically significant shorter 3’ UTRs than CPEB1-KD cells, 
but only for RNAs containing a CPE near the proximal alternative PAS 
(Supplementary Figs 6a, 11 and Supplementary Tables 2, 3 and 4). 868 
UTRs were shorter in control cells, as opposed to 16 cases that were 
shorter in the knockdown. Of the former, 61% had a CPE motif near 
the APA site (P<10 ‘°, Supplementary Table 4). Thus, we observed 
a correlation between the presence of CPEs in the proximity of an 
alternative PAS and the use of this site when CPEB1 was present, 
but not in its absence, to produce shorter 3’ UTRs (Supplementary 
Figs 15-19). This was also the case of BUB3 3’ UTR, for which the 
reads confirmed the results obtained by Southern blot and RT-qPCR 
experiments (Fig. 3h). For a few randomly selected candidates, we 
performed 3’ RACE (rapid amplification of cDNA ends) RT-PCR 
followed by Southern blot in HD-MyZ cells expressing or not the 
CPEB1 shRNA. In all three cases, we observed an increase in the long 
versus short 3’-UTR ratio on depletion of CPEB1 (Fig. 3e). Moreover, 
the change in 3'-UTR length had a direct effect on the final accumula- 
tion of the encoded proteins (Fig. 3f). As is the case for cytoplasmic 
polyadenylation”, gene ontology analysis revealed that transcripts 
with CPEB1-regulated APA are enriched in cell cycle/growth/adhe- 
sion proteins and oncogenes (Supplementary Table 5). We performed 
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Figure 3 | CPEB1 regulates recruitment of U2AF65, the use of proximal 
polyadenylation sites and BUB3 mRNA translational control. a, Schematic 
representation of radiolabelled RNA probes derived form BUB3 3’ UTR. b, The 
indicated radiolabelled RNA probes were incubated with HeLa nuclear extract, 
ultraviolet-crosslinked and digested with RNase. U2AF65 or CPEB1 were 
immunoprecipitated and detected by autoradiography after 10% SDS-PAGE. 
c, HD-MyZ cells were transfected with vectors containing the Renilla luciferase 
ORF fused to the indicated BUB3 3’ UTRs. Firefly luciferase ORF was used as a 
control. The relative levels of Renilla to Firefly activities, normalized by their 
mRNA levels, are shown (R/F). Error bars, s.d. d, HD-MyZ, as in Fig. 2, cells 


RNA-immunoprecipitation for CPEB1, but the large fraction of CPEB1- 
bound mature mRNAs masked the association with pre-mRNAs (see 
Supplementary Methods). 

We found a general, but tissue-specific, trend towards shorter 3’ 
UTRs in CPE-containing mRNAs (Supplementary Fig. 20; Sup- 
plementary Tables 6 and 7). Thus in low differentiated tissues or in 
tumoral samples, the correlation between shorter 3’ UTRs and the 
presence of CPEs was highly significant, whereas differentiated tissues 
expressed the longest possible 3’ UTR. This differential effect is con- 
sistent with the distribution of cis-acting elements in their 3’ UTRs, 
(Supplementary Figs 21 and 22; Supplementary Table 3). Finally, we 
performed a ‘naive’ motif enrichment analysis in the regions sur- 
rounding the polyadenylation sites in control and CPEB1-KD cells 
(Supplementary Tables 12-15). The pair PAS/CPE is the most signifi- 
cantly enriched pair of motifs linked to CPEB1-mediated APA events. 

These results support a general model in which nuclear CPEB1 
promotes alternative 3’-UTR formation by a dual mechanism (Fig. 4). 
First, CPEB1 helps to recruit CPSF to upstream suboptimal PAS that 
have CPEs closer than 100-150 nucleotides, either upstream or down- 
stream. In the absence of CPEB1, these transcripts are processed in the 
strong and more 3’ PAS, thus generating a long 3’ UTR with greater 
potential for containing regulatory elements. When CPEB] is present 
it promotes the use of an upstream APA site, thus generating shorter 3’ 
UTRs that exclude the regulatory elements. In addition, when the CPEs 
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3/882 PAS4 
were treated (CPEB1-sh+) or not (CPEB1-sh—) with doxycycline and the 
levels of BUB3, CPEB1 and o-tubulin were analysed by Western blot. Total 
RNA was purified and analysed by 3’-RACE-Southern-blot using specific 
probes for the indicated genes (e) or by deep sequencing. g, The distribution of 
relative 3’-UTR length for KD and control, with and without occurrence of 
CPE motif, is plotted (see Methods). f, Extracts from e were analysed by 
Western blot, with «-tubulin as control. h, mRNA sequencing coverage for 
BUB3 (NM_004725 and NM_001007793). The read coverage densities on a log 
scale for KD (upper) and WT (lower) are indicated. The PAS and 3’ SS are 
indicated. 


are upstream of the cleavage site (Supplementary Table 8), the resulting 
transcripts include a CPE at an optimal distance of the PAS for their 
regulation by CPEB] in the cytoplasm™. Second, when the CPE-driven 
alternative PAS is present in the proximity of an alternative splice 
site, the CPEB1-CPSF complex interferes with the recruitment of 
the splicing machinery. Alternative 3’-UTR formation is mediated 
by a combination of these two mechanisms, APA and alternative 
splicing, including cases where the kinetics of cleavage eliminate 
downstream competing splicing sites, exons or polyadenylation sites. 
Conversely, kinetic competition can favour splice site selection and 
thus eliminate alternative polyadenylation sites. In addition to these 
effects in cis, CPEB1 indirectly modulates alternative splicing by 
changing the relative levels of splicing factors, possibly by regulating 
the translation and/or the alternative 3’ UTRs of their mRNAs. 
Thus, the function of CPEB1 is to recruit CPSF to a PAS, thereby 
promoting cleavage and polyadenylation in alternative PAS in the 
nucleus and poly(A) tail elongation and translational regulation in 
the cytoplasm. This dual function of CPEB1 provides transcript- 
specific coordination of pre-mRNA processing and translation control 
at two levels. First, the cytoplasmic polyadenylation complex would 
be preassembled during CPEB1-mediated pre-mRNA processing. 
Second, the same CPE that promotes the use of the alternative PAS 
will be then placed at the appropriate distance to promote translational 
activation in the cytoplasm. In the longer 3’-UTR variants, generated 
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Figure 4 | Model for CPEB1-mediated regulation of the alternative 3’-UTR 
formation. Schematic representation of alternative polyadenylation (top) and 
alternative splicing (bottom) and in the absence (left) and presence (right) of 
CPEB1. Abbreviations: C, CPE; wPAS, weak PAS; sPAS, strong PAS; RE, 
regulatory element; CV1, cleavage 1; CV2, cleavage 2; Nuc, nucleus; Cyt, 
cytoplasm; Py, polypyrimidine tract; SS1, 3’ splicing site 1; SS2, 3’ splicing site 2; 
CPSF, cleavage and polyadenylation specificity factor; CPEB, cytoplasmic 
polyadenylation element binding protein 1; and U2AF65, U2 small nuclear 
ribonucleoprotein auxiliary factor. 


in the absence of CPEB1 from the more 3’ consensus PAS, the result- 
ing CPEs would be too far from the PAS to assemble the cytoplasmic 
polyadenylation complex”. This dual function of CPEB1—as a trans- 
lational regulator and as a pre-mRNA processing factor—provides a 
platform from which to coordinate nuclear and cytoplasmic gene 
expression regulation. 


METHODS SUMMARY 


CPEB1-KD cell lines were generated as described’’. For 3’ RACE and Southern 
blot analysis, total RNA was extracted, retro-transcribed using a 3'-RACE oligo 
and amplified using specific primers. After electrophoresis, the agarose gel was 
treated with a denaturation solution and then with a neutralization solution. The 
DNA transferred and cross-linked to a nylon membrane, which was then pre- 
hybridized with Church buffer and hybridized with each **P-labelled probe. For 
luciferase-reporter assays, HD-MyZ cells were transfected with each BUB3 3' UTR 
fused downstream of Renilla luciferase ORF, together with the control plasmid 
(Firefly luciferase). Luciferase activity was detected and normalized to RNA 
content. Ultraviolet cross-linking and immunoprecipitation **P-labelled RNAs 
were incubated with HeLa nuclear extract under in vitro splicing conditions, 
ultraviolet-crosslinked and digested with RNase A. U2AF65 or CPEB1 was immu- 
noprecipitated and loaded on 10% SDS-polyacrylamide gels. Gels were dried and 
exposed to a PhosphorImager screen. For mRNA sequencing, DNase-treated total 
RNA was fragmented and poly(A)-positive fragments were reverse-transcribed to 
construct the libraries. The samples were then analysed on a Genome Analyzer IIx. 
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Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Antibodies. Anti-CPEB1 antibody was from Proteintech (13274-1-AP). Anti- 
U2AF65 was an MC3 monoclonal antibody hybridoma supernatant”. Anti-BUB3 
was from Abcam (ab-4180). Anti-SFRS2 was from Sigma ($4045). Anti-FN1 was 
from Abcam (ab-2413). Anti-CDYL was from Abcam (ab-3999). Anti-SNRPB2 was 
from Proteintech (13512-1-AP). Anti-SF3a-60, anti-SF3a-66, anti-SF3a-120 and 
anti-SF3b-155 were kind gifts from A. Kramer. Anti-U1-70Kda was from Santa 
Cruz (sc-9569). Anti-CPSF100 was from Abcam (Ab126760). Anti-CPSF160 was 
from Abcam (Ab81552). Anti-UPF1 was from Abcam (Ab10510). Anti-o-tubulin 
was from Sigma (T902-6). 

Oligonucleotides. For 3’ RACE: 5'-TAATACGACTCACTATAGGGCGG 
ATCCTTTTTTTTTTTTTTTTTITTITITTTVN-3’ For BUB3 pre-mRNA RT 
after RIP: 5'-TATCGAGCAGGCATCAACAA-3'’. 

For PCR: sense primers: BUB3, 5'-ATTTGCCACAGGTGGTTCTG -3’ (endo- 
genous) or 5’-GTCGACGACACTTGCTCAAC-3’ (minigene); FN1, 5’-GACA 
GAGAAGATTCCCGAGAGTA-3';, CDYL_ 5’-TGCAGAGGAAGATCGATG 
AG-3', SNRPB 5’-CAAGAAATAACATTTGGGATAGTCG-3". Antisense pri- 
mer: 5’-GTAATACGACTCACTATAGGGC-3’. 

For Southern-blot probes: BUB3 Exon7, 5'-ATGATGGGACTACGCTTGCA-3’; 
BUB3 PASI, 5'-TGGGGAGTACGAATTGTTTT-3’; BUB3 PAS2, 5’-GCA 
CTTAAGAAGGTGTTTATAATT-3’; BUB3 PAS3, 5’-GAAACAAAAAATC 
ATCAAGGTC -3'; BUB3 PAS4, 5’-CAAGGGTTCTGCATCTCG-3'; BUB3 
3'SS1, 5'-GTACATGGTGACTTGGGTTTTG-3’, BUB3 3’SS82, 5’-GATTAGG 
TGGACTTGGGTTTTG-3’; BUB3 $3, 5’-GATTAGGTGGACCTTTAATG 
CAG-3’; FN1 5’-TCCAATCCAGAGGAACAAGC-3’; CDYL 5’-TTCCCCTGA 
TCCATTCTCAC-3’, SNRPB2 5'-CAGTGTTTGTTTTGATAACATTTGG-3’. 
For RT-qPCR: FMRIex16 gene expression 5'-CCCGAACAGATAATCGTCCA- 
3’ (sense) and 5'-TGAAGGGATCCATCTGTTGTT-3’ (antisense); FMR1ex16 
inclusion 5’-AAGAAGACATGATAGGATTGTGAGTTT-3’ (sense) and 5’-CT 
TTAACTGGAACACACTGGTCA-3’ (antisense); FMR1ex16 Skipping 5’-CAGA 
TGGATCCCTTCAGATCA-3’ (sense) and 5‘-CATTCACGAGTGGTTGCTGA- 
3’ (antisense); MKNK2 gene expression 5’-CCCCGGAGAACACCTTG-3’ (sense) 
and 5’-CTGCAGGACCATGGGAGT-3’ (antisense); MKNK2 inclusion 5’-AA 
CAGCTGTGCCAAAGACCT-3’ (sense) and 5’'-ACTGGGAGGGTGGAGA 
CAG-3' (antisense); MKNK2_ skipping 5’-AGAACACCTTGCCCACTCC-3' 
(sense) and 5'-GGAAGTGACTGTCCCACCTC-3’ (antisense); SNRPB gene 
expression 5’-CCACCTCCTGGTATGAGACCT-3’ (sense) and 5’-CATTGGA 
GTCCCTCTTCCAG-3’ (antisense); SNRPB inclusion 5’-ACCCTTGGCCACAG 
AGTATG-3’ (sense) and 5’-CCATGAGACTCCACGAACAA-3’ (antisense); 
SNRPB skipping 5’-CTGGGATGCGAGGGC-3’ (sense) and 5’-AGGAGGGC 
CAAGATGAGTCT-3’ (antisense); BUB3 exon7, 5'-CGCATCACTTGCCT 
TCAGTA-3’ (sense) and 5’-TGTCACTTGGCGAATGAAGA-3’ (antisense); 
BUB3 inclusion #1, 5'-GITCGACGACACTTGCTCAAC-3’ (sense) and 5’- 
CAAGTACATGGTGACTTGGGTTT-3’ (antisense); BUB3 skipping #1 5'-GT 
CGACGACACTTGCTCAAC-3’ (sense) and 5’-TGATTAGGTGGACTTGGG 
TTTT-3’ (antisense); BUB3 inclusion #2, 5’-GCAGAAACAAAACCCAAG 
TCA-3’ (sense) and 5’-ATCCACCATTGGGGAGTACG-3’ (antisense); BUB3 
skipping #2 5'-CAGAAACAAAACCCAAGTCCA-3’ (sense) and 5’-GACA 
GAAAACAGAGAAATCCACA-3’ (antisense); FMRlex9 gene expression 5’- 
AGCTTGCCTCGAGATTTCAT-3’ (sense) and 5'-CAATAGCAGTGACCCC 
AGGT-3’ (antisense); FMRlex9 inclusion 5’-CACCATCACCATCGCAAA-3’ 
(sense) and 5’-TTTCAGCCTCAATCCTCACC-3’ (antisense); FMR1ex9 skipping 
5'-TTCCAAGGAACTTAGTAGGCAAA-3’ (sense) and 5'-TTTCAGCCTCAAT 
CCTCACC-3' (antisense); TRA2B gene expression 5’-GCGTCACATCCGG 
TAGAGTT-3’ (sense) and 5’-CAACCTCTTGCACCTTCCTT-3’ (antisense); 
TRA2B inclusion 5'‘-AAGGAAAATGCGGAAGTCGT-3’ (sense) and 5’-CAT 
TTTCCTTCTTCAACATTAACC-3’ (antisense); TRA2B skipping 5’-AAGGAAG 
GTGCAAGAGGTTG-3’ (sense) and 5’-GGAAGCAGAACGGGATTC-3’ (anti- 
sense); SFRS2 gene expression 5’-GGTCCAGGTCCCGGTCTC-3’ (sense) and 
5'-TTGGATTCCCTCTTGGACAC-3’ (antisense); SFRS2 inclusion 5’-CTTGG 
TTATTTGGCCAGGAA-3’ and 5’'-CACAAAGGCTACCATCAGCA-3’  (anti- 
sense); SFRS2 skipping 5’-GGTCCAGGTCCCGGTCTC-3’ (sense) and 5'-TGCT 
TGCCGATACATCATTTT-3’ (antisense); GAPDH 5'-GAGTCAACGGATTTT 
GGTCGT-3’ (sense) and 5’-TTGATTTTTGGAAGGGATCTCG-3’ (antisense); 
MRPL19 5'-CAGTTTCTGGGGATTTGCAT-3’ (sense) and 5’-TATTCAGGA 
AGGGCATCTCG-3’ (antisense), GLA 5’-CAGAAATCCGACAGTACTGCAA- 
3’ (sense) and 5’-CATATCTGGGTCATTCCAACC-3’ (antisense). 

For mutagenesis: CPEs, 5’-CCAGGGAAAATATTAATTTGGATATTATA 
ACAACC-3' and 5'-GGTTGTTATAATATCCAAATTAATATTTTCCCTGG- 
3', 5'-TAAATAAACACCTTCTTAAGTGCATGAG-3' and 5'-CCTCTTTT 
CCATTATTTTCAGG-3’; PASI, 5'‘-GITGCCATGTTGATGATAAGGAAACA 
ATTCGTACTCCCC-3’ and 5’-GGGGAGTACGAATTGTTTCCTTATCATC 
AACATGGCAC-3’; PAS2, 5'-GGTTTTTGAATTTTTTTTTTTAAATGGGCA 


CCTTCTTAAGTGC-3’ and 5’-GCACTTAAGAAGGTGCCCATTTAAAAA 
AAAAAATTCAAAAACC-3'; polypyrimidine tract, 5’-TTGATCTCATGCT 
GTTCTGTGTGTCCCCCGAACTTGTAGGTCACC-3’. 

Plasmid constructions. BUB3 exon7, intron7 and exon8 were subcloned into 
pCMV56™. Each of the five BUB3 3’ UTRs were subcloned downstream of the 
Renilla luciferase (pRL-SV40 from Promega). The SV40 poly(A) signal of these 
constructs was then removed by Hpal digestion. 

Cell cultures, DNA transfections and RNA purification. Exponentially growing 
HD-Myz cells grown in Roswell Park Memorial Institute medium (RPMI- 
1640+ glutaMAX-I), 10% FCS (fetal calf serum) and 1% penicillin/streptomycin 
were transfected at 80% confluence with 0.2 ug of plasmid DNA using Effectene 
transfection reagent (Quiagen), following the manufacturer’s instructions. Cells 
were incubated 24 h before RNA purification and total RNA was isolated using the 
RNAspin Mini Kit (GE Healthcare). An additional treatment with DNase was 
performed using the Turbo DNA-free Kit (Ambion). 

CPEBI1-KD cell line. The CPEB1-KD HD-MyZ cell line was generated as 
described previously’. Briefly, HeLa cells or HD-MyZ cells, stably transfected with 
a doxycycline-inducible CPEB1 shRNA vector, were induced (CPEBIShRNA+) 
or not (CPEB1ShRNA—) with doxycycline for 4 days for subsequent analysis. 
Transient CPEB1-KD were performed transfecting sh vectors against the following 
target sequences: 5’-GCACUUGCUGAAUCUGUCUUU-3’ and 5'-GCAGG 
UCUCAUUGGUUUCAUU-3’ Transient UPF1-KD was performed by siRNA 
transfection using lipofectAMINE 2000 (Invitrogen). The target sequence of each 
siRNA is: 5'-GAGAUAUGCCUGCGGUACA-3’, 5'-GGUCCGGCUCCCCCA 
AAUG-3' 5'-GAUGCAGUUCCGCUCCAUU-3’. 

Immunofluorescence. Cells were plated on poly-lysine pre-coated glass cover- 
slips. 12h later, cells were fixed in PBS/formaldehyde 4% (30 min) and permea- 
bilized at room temperature for 5 min in 0.1% Triton/PBS. They were then blocked 
with 1% BSA for 1 h and incubated with primary antibodies against CPEB1 alone 
or in combination with the SFRS2, washed, and then labelled with the matching 
secondary antibodies. Images were obtained on an inverted Leica TCS SP5 con- 
focal microscope with a 63X 1.4 NA PLAN APO objective. 

Cell extracts and Western-blot analysis. HD-MyZ or HeLa cells were lysed in 
Triton buffer (20mM HEPES pH 7.0, 150mM NaCl, 1% Triton X-100, 10% 
glycerol, 1mM EDTA, 1mM phenylmethylsulphonyl fluoride, 1 X protease 
inhibitors (Sigma). The lysates were sonicated using Bioruptor (Diagenode) and 
centrifuged at 16.000g and supernatants were resolved by 10% SDS-PAGE. 30 pg 
of lysate was loaded onto each lane. 

Immunoprecipitation. HeLa nuclear extracts were incubated at 30 °C for 30 min 
in cleavage conditions (see ‘in vitro cleavage assay’ section below) in the presence 
of protease inhibitors. Extracts were precleared, incubated at 37 °C for 30 min with 
RNase A 50 1g ml’ and immunoprecipitated with anti-CPEB1 antibody or with 
rabbit IgG for 4h at 4°C. Immunoprecipitates were washed in cold lysis buffer 
(1.5 mM MgCl, 20mM HEPES pH 7.5, 150 mM NaCl, 1% NP40, 1mM EDTA, 
10% glycerol, 0.1 mM DTT, 1mM PMSF, 1X proteases inhibitor) and the beads 
were resuspended in loading buffer 1X, boiled at 60 °C for 20 min and resolved by 
SDS-PAGE together with the inputs. 

RNA immunoprecipitation. HD-MyZ cells were transfected with the WT or 
CPE-mutated minigenes. After 15 h they were washed with PBS, cross-linked with 
1% formaldehyde for 10 min at 30 °C and treated with glycine 1 M. After washing 
twice with PBS, cells were lysed in lysis buffer (1.5 mM MgCl, 20mM HEPES pH 
7.5, 150mM NaCl, 1% NP40, 1mM EDTA, 10% glycerol, 0.1 mM DTT, 1mM 
PMSF, 1X proteases inhibitor, 1X RNase inhibitor). Lysates were precleared, 
immunoprecipitated with anti-CPEB1 antibody or with rabbit IgG for 4h at 
4°C, Immunoprecipitates were washed in cold lysis buffer supplemented with 
RNase inhibitors. Beads were resuspended in lysis buffer with 0.1% SDS, 3 mg 
of proteinase K and incubated for 30 min at 55 °C and RNA was extracted for RT- 
qPCR analysis. 

3’ RACE and Southern blot analysis. RT-PCR was carried out using the 
SuperScript II reverse transcriptase kit (Invitrogen), following the manufacturer’s 
instructions, starting with 500 ng of DNase-treated total RNA and the 3’ RACE 
primer. 

After PCR using each couple of specific primers, the samples were loaded in a 
1.5% agarose gel and run for 2h at 90 V. The gel was then treated with a dena- 
turation solution (1.5 M NaCl, 0.5 M NaOH) for 30 min and then with a neutrali- 
zation solution (1 M Tris, 1.5 M NaCl pH 7.4) for 45 min. After transfer, the DNA 
was cross-linked to the nylon membrane (254 nm; 0.12 J); the membrane was pre- 
hybridized with Church buffer for 3 h, hybridized with each **P-labelled probe for 
12h and washed with SSC 1X, 0.1% SDS three times. 

For Southern-blot ‘APA quantifications’, 3’-SS1 and PAS1 probes detected all 
the 3’-SS1-derived 3’-UTR-cDNAs allowing for direct comparison between PAS1, 
PAS2 and PAS3. For Southern-blot ‘Splicing quantifications’ common exon7 
probes detected all the possible BUB3 3'-UTR variants. For all the variants, except 
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the doublet 3’-SS1/PAS3-SS2/PAS4, quantification can be directly derived from 
exon7 signal. 3'-SS1/PAS3 migrated very close to SS2/PAS4, detected as a doublet 
with exon 7 probe. As Southern-blots were performed by successive re-probing of 
the same membrane with the different splicing and cleavage probes, a comparison 
between lanes, expressed always as ratios for each probe, was made. 

Therefore, the ratio 3’ SS1/PAS3- SS2/PAS4 in each doublet was calculated as in 
the following example. Let X be the signal of 3’ SS1/PAS3 in WT conditions (either 
cells or minigene), X; the signal of 3’ SS1/PAS3 in mutant conditions (either cells 
or minigene), Y the signal of SS2/PAS4 in WT conditions (either cells or minigene) 
and Y; the signal of SS2/PAS4 in mutant conditions (either cells or minigene) and 
in the above example. Then, X+ Y / X;+Y;=0.91 (Fig. 3, lanes 1 and 2); 
X/X; = 0.38 (Fig. 3, lanes 25 and 26) and Y/Y;= 1.6 (Fig. 3, lanes 9 and 10). 
Thus, X/Y = 0.3 and X,/Y, = 1.3. Factors from which the ratio of 3’ SS1/PAS3 
and SS2/PAS4 could be derived form the signal of the doublet in each line. 

In these equations, there are two groups of errors. One is derived from the origin 
of the sample: in the example from Fig. 3a, lanes 1, 25 and 9 are derived from 
reprobing the same membrane three times (that is, they are the same sample, 
RT-PCR, gel, and so on), which will all have the same error (sample 1 error, «) 
derived from loading (in the case of the minigene, transfection), RT-PCR, and so 
on. The same is true for lanes 2, 26 and 10, which will have all the same error 
(sample 2 error, /). The second source of differential errors is derived from using 
three different probes: exon7 (lanes 1, 2), PAS3 (lanes 25, 26) and 3’ SS2 (lanes 9, 
10) (probe errors a, b and c respectively). 

Now, if we include the errors we can generate the following set of equations with 
multiple variables: to determine the relative composition of the doublet band 
corresponding to SS1/PAS3 + SS2/PAS4, quantifications from lane 1 will have 
to be corrected by the error factors a and «, whereas the same band quantified 
from lane 2 would have to be corrected by factors a and 3. Quantification of both 
bands will produce a ratio where: ax(X + Y)/af(X; + Y;) = 0.91, and neutralizing 
the errors from the probe: 


o(X + Y/B(X; + Y,) = 0.91 


For the quantification with the probe PAS3 (lanes 25 and 26) that detects only 
SS1/PAS3, but not SS2/PAS4, UTR variant, we can derive the following equation: 
baX/bbX; = 0.38, and neutralizing the errors from the probe: 


oX/BX, = 0.38 


In the same way, for the quantifications with the 3'-SS2 probe (lanes 9 and 10) 
that detects only 3’ SS2/PAS4, but not 3’ SS1/PAS3, UTR variant, we can derive 
caY/cBY; = 1.6, and then: 


aY/BY; = 1.6 
Now, if we start solving the multiple equations by substituting in the first one 
we get: 
uX + «Y= 0.91PX, + O.91BY, 
BX, = «X/0.38 
BY, = «Y/1.6 


Substituting the last two equations in the first: 


aX + «Y = 0.910X/0.38 + 0.910Y/1.6 


and then a can be simplified and we are left with X/Y = 0.3. 
In a similar way, if we substitute instead aX and aY we obtain X;/Y; = 1.3. 


LETTER 


As shown, with these normalizations the errors derived both from the two 
different origins of the samples and from the use of three different probes are 
neutralized, demonstrating that the samples can be compared. 

Analysis of the translational control by BUB33’ UTRs. 0.2 X 10° HD-MyZ cells 
were transfected with 300 ng of each BUB3 3'-UTR-Renilla luciferase, together 
with 30 ng of the control plasmid (Firefly luciferase). After 24h, the RNA was 
extracted for quantification by qPCR. Luciferase activity was detected using the 
Dual-Luciferase Reporter Assay System (Promega), following the manufacturer’s 
instructions. 

In vitro transcription, ultraviolet cross-linking and immunoprecipitation. 
Transcription templates were generated by PCR from the different plasmids, 
with a T7 promoter included in the upstream primer. Radiolabelled pre- 
mRNAs were transcribed for 2 h at 37 °C by using approximately 1 jg of template 
DNA in 25 of transcription reactions containing final concentrations of 
40mM Tris-HCl, pH 7.9, 10mM NaCl, 6mM MgCl, 2mM spermidine, 
0.8mM dithiothreitol, 0.5mM ATP, CTP and GTP, 0.05mM UTP, 30 pCi 
[x-*?P]UTP, and 60 units of T7 or 42 units of SP6 RNA polymerases (Promega). 
The transcripts were isolated by gel purification followed by phenol/choloroform/ 
isoamylalcohol (25:24:1) extraction and ethanol precipitation. 

*?P_labelled RNAs were incubated under in vitro splicing conditions with HeLa 
nuclear extract in a total volume of 27 ul. After incubation for 30 min at 30°C, 
mixtures were irradiated with ultraviolet light (254 nm; 0.4 J). After incubation 
with 1 mgml~’ RNase A at 37°C for 30 min, U2AF65 was immunoprecipitated 
with the MC3 monoclonal antibody hybridoma supernatant. After incubating for 
60 min at 4°C, 30 ul of a 50% slurry of protein A/G sepharose beads were added 
and the mixture was incubated under rotation for another 60 min. Beads were 
sedimented by centrifugation and washed 4 times with 800 pl of high salt buffer 
(500 mM NaCl, 50mM Tris-HCl pH 8.0, 1% NP-40) and once with the same 
buffer with 100 mM NaCl. Sedimented beads were resuspended in 4 X SDS load- 
ing dye, heated at 95 °C for 5 min, and briefly centrifuged. The supernatant was 
then loaded on 10% SDS-polyacrylamide gels. Gels were dried and exposed to a 
PhosphorImager screen. 

In vitro cleavage assay. In vitro cleavage assay was performed as previously 
described’, incubating HeLa nuclear extract with the probes as in Fig. 3d in the 
presence or not of 0.1 4M recombinant CPEB1. 

Flow cytometry. DNA content was measured by staining with Hoechst as indi- 
cated and analysed by flow cytometry with an LSR machine (Becton Dickinson). 
Flow cytometry data were analysed with FlowJo software. 

Microarray analysis. Cy3- and Cy5-labelled CRNAs were generated from 500 ng 
of total RNA using the Quick-Amp Labelling Kit, No Dye (cat. no. 5190-0447) and 
Cy3 and Cy5-CTP from Perkin-Elmer (NEL580 and NELS581). 6 ug of each CRNA 
was hybridized in our custom oligo microarray platform (Agilent) in an Agilent 
rotating oven at 60°C for 18h. After hybridization, the arrays were washed, and 
scanned images were analysed. Statistical analyses were carried out with LIMMA 
package (Bioconductor). Microarray results have been deposited in the GEO 
database (GSE32440). 

qPCR. PCR was carried out in a LightCycler 480 (Roche) using SYBRGreen I 
Master (Roche) and the primers indicated in ‘oligonucleotides’ section above. 
Three sets of primers were used for each alternative splicing event analysed; one 
pair monitored gene expression by amplifying a constitutive exon, and the other 
two pairs specifically amplified each single isoform (inclusion versus skipping). 
mRNA sequencing. DNase-treated total RNA was fragmented to about 400 nt. 
Poly(A)-positive fragments were captured, reverse-transcribed and used to con- 
struct the libraries. The samples were then analysed on the Genome Analyzer II x. 
Data are available at the European Nucleotide Archive (ERP001603). 


24. Gebauer, F. et al. Mouse cytoplasmic polyadenylation element binding protein: 
an evolutionary conserved protein that interacts with the cytoplasmic 
polyadenylation elements of c-mos mRNA. Proc. Natl. Acad. Sci. USA 93, 
14602-14607 (1996). 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature11848 


Corrigendum: Global quantification of mammalian gene expression 


control 


Bjorn Schwanhdausser, Dorothea Busse, Na Li, Gunnar Dittmar, Johannes Schuchhardt, Jana Wolf, Wei Chen & Matthias Selbach 


Nature 473, 337-342 (2011); doi:10.1038/nature10098 


Mark Biggin of the Lawrence Berkeley National Laboratory contacted 
us, noting that our mass-spectrometry-based protein copy number 
estimates are lower than several literature-based values. We therefore 
re-analysed the scripts used for data processing, and found a scaling 
error that occurred during the conversion of normalized protein 
intensity values into absolute copy number estimates. As described 
in the original Article, slope and offset for scaling were calculated 
by linear regression based on an in-solution digest with spiked-in 
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Figure 1 | Comparison of LC-MS/MS-based protein copy number estimates 
in NIH3T3 cells with alternative methods. a, Representative western blots of 
cellular proteins with dilution series of purified protein standards. Standards 
were diluted in a way that one-fold corresponds to the amount expected from 
the average of the LC-MS/MS-based estimates. The asterisks indicate the 
position of the GST-fusion proteins. b, Comparison of estimates based on 
western blots (blue, n $ 3) and selected reaction monitoring (red, n 5S 3) with 
our LC-MS/MS data (n 5 2). Error bars show standard deviations. 
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proteins of known concentrations. We erroneously used the slope 
and the offset from an unrelated experiment to scale protein levels, 
resulting in a systematic underestimation of protein levels and derived 
translation rate constants. We apologize for this error and any con- 
fusion it may have caused. 

When the error was corrected, the median levels of detected pro- 
teins increased about threefold and the ratio of average protein to 
messenger RNA increased from 900 to 2,800. The median and ap- 
parent maximum translation rate constants increased from 40 to 140 
and from 180 to 1,000 proteins per mRNA per hour, respectively. 
Consequently, the estimated maximum translation rate constant in 
sea urchin embryos at 15 uC (140 proteins per mRNA per hour) is 
lower than our corrected prediction for mouse fibroblasts (1,000 
proteins per mRNA per hour). All our conclusions about global gene 
expression control (correlations between mRNA and protein levels 
and half-lives, predominant control of protein abundance at the level 
of translation, functional properties of genes with specific half-life 
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Figure 2 | This figure shows the corrected panels for Figs 2b and d, 3c and d 
and 4b and c of the original Article. We note that although the distribution of 
data in the original and corrected figures appears very similar, the axes are different. 
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combinations and so on) are unaffected. Figure 2 of this Corrigendum 
shows the corrected Figs 2b and d, 3c and d and 4b and c. Supplemen- 
tary Figs 5a and b, 6d and f, 8f, 12a and b, and Supplementary Tables 1 
and 3 of the original Article have been corrected. Protein copy numbers 
and translation rate constants in the text and figures in the HTML and 
PDF versions of the original Article have been corrected. 

To further validate copy numbers in NIH3T3 cells we performed 
western blots with a dilution series of purified human proteins as 
standards (Fig. 1a of this Corrigendum). Briefly, cells were washed, 
harvested by trypsinization, counted independently by two persons, 
lysed in radioimmunoprecipitation assay buffer (containing 1% SDS) 
and separated by SDS-polyacrylamide gel electrophoresis (PAGE). 
As standards, defined amounts of human glutathione-S-transferase 
(GST)-tagged HDAC3, TUBB (Abnova), RHOA, RAC1 or CDC42 
(purified in house and quantified spectrophotometrically) or purified 
ACTB (Biotrend) were diluted in SDS sample buffer containing 
0.07 mg Escherichia coli lysate per microlitre to minimize protein loss 
during dilution. Antibodies against HDAC3 (2632), CDC42 (2466) 
and Rac1/2/3 (2467) were from Cell Signalling; the ACTB (A5441) 


CORRIGENDUM 
doi:10.1038/nature11977 


CORRIGENDUM | RESEARCH | 


and TUBB (T8328) antibodies were from Sigma and the anti-RHOA 
antibody (SC-418) was from Santa Cruz. Protein abundance in 
NIH3T3 cells was estimated densitometrically, based on the dilution 
series as a standard curve (Scion Image). We also used selected reaction 
monitoring to quantify two additional proteins (p100 and p105). To 
this end, cells were lysed (6 M urea, 2 M thiourea) and lysates mixed 
with synthetic-stable-isotope-labelled proteotypic peptides (SpikeTides, 
JPT Peptide Technologies). Samples were digested and analysed on a 
Q-Trap 5500 system (AB Sciex) in three technical replicates moni- 
toring three transitions per peptide. Quantification was performed 
using Multiquant 1.2 (AB Sciex) based on the two most intense transi- 
tions. Overall, copy number estimates of the eight proteins obtained 
by alternative approaches correlated well with our data derived from 
liquid chromatography and tandem mass spectrometry (LC-MS/MS) 
(Fig. 1b of this Corrigendum), even though the two measurements 
based on selected reaction monitoring lie above the diagonal. The data 
are in good agreement with the expected precision and reproducibility 
of our large-scale absolute protein quantification approach (see 
Supplementary Figs 6d and 8b of the original Article). 


Corrigendum: Structure-based prediction of protein—protein 


interactions on a genome-wide scale 


Qiangfeng Cliff Zhang, Donald Petrey, Lei Deng, Li Qiang, Yu Shi, Chan Aye Thu, Brygida Bisikirska, Celine Lefebvre, 
Domenico Accili, Tony Hunter, Tom Maniatis, Andrea Califano & Barry Honig 


Nature 490, 556-560 (2012); doi:10.1038/nature11503 


In this Letter, one of the points shown in Fig. 2 and Supplementary 
Figs 8, 9 and Supplementary Table 4 reflects the presence of interac- 
tions that had been erroneously deposited from a previous publication’ 
into the IntAct database. We have now used the MINT database to 
retrieve these interactions, and Fig. 2 is corrected here (shown below as 
Fig. 1). The error in IntAct was corrected on 9 November 2012 in con- 
sultation with the original authors of the paper. We thank S. Michnick 
for bringing this to our attention. We also thank M. Maletta for point- 
ing out that Supplementary Fig. 10C was mislabelled and erroneously 
indicated that NK X2-2 protein was not included in the experiment. See 
Supplementary Information to the original paper for corrected ver- 
sions of Supplementary Figs 8-10C and Supplementary Table 4. These 
errors do not affect the results or conclusion of the paper, and have 
been corrected in the HTML and PDF of the original paper. 


1.  Taeassov, K. et al. Anin vivo map of the yeast protein interactome. Science 320, 
1465-1470 (2008). 
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Figure 1 | This is the corrected Fig. 2. 
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EDUCATION 


A sense of community 


US scientists passionate about teaching can find rewarding work at community colleges. 


BY NEIL SAVAGE 


( vt Knights used to see his professional 
future laid out before him, and it basi- 
cally involved cancer research. Then a 

colleague told him about a part-time teaching 

position at Northern Virginia Community 

College in Alexandria, not far from Washing- 

ton DC, where Knights was doing a postdoc 

at Georgetown University. He took the job for 
extra money, and ended up falling in love with 
the work. 

Despite the objections of his principal 


investigator and his thesis adviser, Knights 
tooka full-time job at the community college 
in 2007. “The ability not just to teach the stu- 
dents but to inspire them to share in my love of 
science gave me a new purpose, he says, recall- 
ing a student from Ghana who told him what a 
privilege it was to attend college, and a woman 
in her forties who teared up when she first saw 
a cell through a microscope. 

Teaching at a community college may not be 
a career option that occurs to many scientists. 
It certainly wasn’t on Knights’ radar. But it can 
bea rewarding choice for scientists passionate 
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about teaching who enjoy working with stu- 
dents from a wide variety of educational, cul- 
tural and socioeconomic backgrounds. 

In the United States, community colleges 
are generally publicly funded institutions 
offering a wide range of educational pro- 
grammes, from remedial work to job train- 
ing or associate's degrees. Some students are 
recent high-school graduates, or those who 
did not finish high school but took a test to 
earn the equivalent ofa high-school diploma. 
Others are adults who want retraining ina 
specific field such as nursing, or military > 
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> veterans trying to join the civilian work 
force. Some plan to transfer to a university 
after completing their programme. And some 
are immigrants for whom English is a second 
language. 

Faculty members spend most of their time 
teaching; they rarely run research programmes 
and are not required to publish. Tenure is pos- 
sible: review committees look for effective 
teaching and participation in other aspects 
of the college, such as serving on committees. 
However, the US Department of Education 
found that in 2009 only 12% of community- 
college teachers were tenured, full-time fac- 
ulty members; another 4% were on the tenure 
track. By comparison, 26% of faculty members 
at conventional universities were tenured, with 
12% on the tenure track. 

In the 2011-12 academic year, average 
pay at community colleges ranged from 
US$45,894 for a lecturer to $73,333 for a 
full professor, according to the latest Annual 
Report on the Economic Status of the Profession 
by the American Association of University 
Professors in Washington DC. At universities 
that grant doctorates, the average pay ranged 
from $56,891 for a lecturer to $130,803 for a 
full professor. 


INTO THE CLASSROOM 

Community-college science teachers are in 
demand. The United States has 1,132 commu- 
nity colleges with a total of 13 million students, 
many of whom will take science courses for an 
associate's degree or a certificate in a techni- 
cal field. Others plan to transfer to a bachelor’s 
degree with a science major elsewhere, or are 
working on business or humanities majors 
and need a science course such as astronomy 
or introductory biology to fulfil requirements. 

Part-time adjunct faculty members fill many 
of the teaching slots, allowing college admin- 
istrators to add or subtract course sections 
on the basis of student demand. Nearly 69% 
of community-college faculty members were 
part-time in 2009, the latest year for which 
figures are available, compared with 41% in 
universities, according to the US Department 
of Education. Some adjuncts are graduate stu- 
dents earning extra money or exploring teach- 
ing as a career option. Others are successful 
business people willing to share their expertise. 
And some commute between part-time jobs 
at several schools to effectively create a full- 
time job. 

One benefit of teaching at a community col- 
lege rather than a high school is that the stu- 
dents are adults, who tend to be more focused 
and driven than teenagers. It can also appeal to 
teachers who prefer the shorter academic cal- 
endar of a college. “I feel it gives you a lot of the 
benefits of working in the college environment, 
while at the same time focusing on teaching,” 
says Jeff Schinske, a biology teacher at De Anza 
College in Cupertino, California. 

The required educational qualifications 
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vary by state, but in general a community- 
college teacher needs a master’s degree or 
higher. “We like to have PhD-level creden- 
tials,” says Donald Brady, assistant dean of 
science, technology, engineering and math- 
ematics at Middlesex Community College in 
Lowell, Massachusetts. Mary Rittling, presi- 
dent of Davidson County Community Col- 
lege in Thomasville, North Carolina, says that 
faculty members need the same educational 
background as they would to teach under- 
graduates at a university. The qualifications 
of their teachers can affect whether a commu- 
nity-college student can transfer their course 
credits to a university programme. 

Applicants should demonstrate currency 
in their field with, for example, recent gradu- 
ate study or a publication record, says Brady. 
“When we look at applications for full-time 
positions, it’s good to see a CV with what the 
individual has published,’ he says. However, he 
adds, the most important criterion is proven 
teaching ability. 

Knights, who sits on his college's hiring com- 
mittee, says that experience as a university 
teaching assistant does not help candidates, 
because it does not generally involve managing 
a class alone. Teachers 
need to know how to 
develop a curriculum, 
deal with disruptive 
students, handle dif- 
ferent educational 
backgrounds among 
students and adminis- 
ter classroom budgets 
for supplies, as well as 
understand when to 
assess students with 


“The first couple tests and quizzes. 
of years, it’s well Knights recommends 
unders tood that that people looking 
your primary for full-time jobs get 
responsibility is some experience as an 
learning on the adjunct faculty mem- 
job.” ber. That will also 
Scott Schultz allow the existing full- 

time faculty members 


to get to know their prospective colleague. 
Unlike primary and secondary schools in 
the United States, community colleges do not 
require their teachers to be certified. However, 
varying levels of educational training are avail- 
able, usually through universities. Northern 
Arizona University in Flagstaff, for example, 
offers a master of arts in science teaching, 
which is open to science graduates who want 
to teach at community colleges. George Mason 
University in Fairfax, Virginia, offers a doctor 
of arts in community-college education. And 
North Carolina State University in Raleigh 
offers a doctorate, a master’s degree and a grad- 
uate certificate in community-college teach- 
ing. The certificate caters mostly to those who 
already have a master’s degree in their field, 
says Chad Hoggan, who runs the programme. 
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Participants learn concepts such as differences 
in learning styles and how to work with them, 
and how to teach online courses, which are 
becoming common. Such skills are important, 
says Hoggan, but he does not promise anyone 
that a certificate will land them a job. “It might 
set you apart from an applicant pool of 60 peo- 
ple,” he says. 

Mentoring support can be thin at smaller 
community colleges and novice teachers can 
feel as if they have been thrown in at the deep 
end, says Scott Schultz, chair of the science 
division at Delta College in University Center, 
Michigan. “If you go to a larger school, there's 
often somebody whos going to take you under 
their wing,” he says. But at small institutions, a 
new faculty member might receive little more 
than a list of class objectives and a copy of the 
textbook used by the previous instructor. “The 
first couple of years, it’s well understood that 
your primary responsibility is learning on the 
job,’ says Schultz. 


TEACHER TRAINING 

To help newcomers to cope, Schultz leads 
the Two-Year College New Faculty Train- 
ing Experience, an 18-month programme 
run by the American Association of Phys- 
ics Teachers in College Park, Maryland. The 
experience includes online discussions and 
two weekend conferences: this year, one is at 
Delta, and the other at the University of Min- 
nesota in Minneapolis. The programme covers 
teaching strategies and introduces new teach- 
ers to more-experienced faculty members. 
It is important to build such networks, says 
Schultz. Of the community-college campuses 
that teach physics, 58% have only one physics 
teacher, or only part-time ones, according toa 
survey conducted last year by Susan White, a 
researcher at the American Institute of Physics 
in College Park. 

The population of biology teachers across 
community colleges is much larger, but the job 
can still get lonely, says Schinske. In primary 
and secondary schools, teachers form com- 
munities of peers by grade level. In universi- 
ties, those in the same research area collaborate 
and attend conferences. But such organizing 
forces do not exist at community colleges. So 
Schinske and Kimberly Tanner, who stud- 
ies biology education at San Francisco State 
University in California, used a US National 
Science Foundation grant to launch Commu- 
nity College Biology Faculty Enhancement 
Through Scientific Teaching, a programme 
held at local colleges and San Francisco State 
University that runs monthly workshops anda 
week-long summer institute on teaching strat- 
egies and techniques. It also partners biology 
instructors with their peers to help them to 
become better teachers. 

The difficult economy has actually boosted 
demand for community-college science teach- 
ers, as some students look for educational 
options that are cheaper than universities and 
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others want to retrain. A 2009 study by the 
American Association of Community Col- 
leges in Washington DC found that enrol- 
ment in for-credit courses had increased by 
16.9% between autumn 2007 — before the 
economic downturn — and autumn 2009. 
“You see a surge when the economy goes 
down, mainly because we provide access 
and affordability, says Rittling. “The 
change in the economy has really impacted 
us in science,’ agrees Sarah Quast, a pro- 
fessor of chemistry at Middlesex Commu- 
nity College. “Our science courses, if they 
havent doubled, are close to doubling” 

Although the US national unemploy- 
ment rate remains high — at 7.9% as of 
January this year — there is a need for 
‘middle-skill jobs’ with community-col- 
lege training, says Matthew Meyer, associ- 
ate vice-president for science, technology, 
engineering and mathematics innovations 
in the North Carolina Community College 
system, based in Raleigh. Nurses, for exam- 
ple, are always in demand, and many get 
their training at community colleges, where 
they take courses such as anatomy, physi- 
ology and microbiology. Dental hygienists, 
radiology technicians and lab technicians 
all take biology, engineering or physics 
courses. Meyer also sees a growing need 
for employees with training in aeronautics, 
nanotechnology, advanced manufacturing 
and life sciences such as pharmaceuticals. 

Those trends will mean more openings 
for science teachers, as will demographic 
changes. “Our faculty on average are in 
their upper forties or fifties,” says Meyers. 
“We're going to have to be able to replace 
them?” 

Although teaching at a community col- 
lege is not research-oriented, scientists can 
still use their training. Schultz says that it 
is not uncommon for community-college 
instructors to publish in educational- 
research journals. “Although we are not 
doing research on subatomic particles, we 
are constantly collecting data and doing 
research on how to improve learning in our 
classroom,’ he says. 

Teaching appeals to the researcher in 
Schinske. Assessing his class involves gath- 
ering data from students about what they 
know at the beginning of the course, get- 
ting feedback on what concepts they under- 
stand and which ones they struggle with, 
and measuring how they have changed by 
the end. In some ways, he says, the expe- 
rience is not unlike his graduate work in 
monitoring fish populations to study their 
evolution. “There are times when I view my 
students very similarly to my study organ- 
isms,’ he says, “and I mean that in only the 
most respectful way.” = 


Neil Savage is a freelance writer based in 
Lowell, Massachusetts. 


TURNING POINT 


Jorg Wiedenmann 


Joérg Wiedenmann, head of the Coral Reef 
Laboratory at the UK National Oceanography 
Centre, Southampton, won a €1.29-million 
(US$1.7-million) European Research Council 
(ERC) Starting Grant in December. He will 
use the money to investigate how nutrient 
starvation influences coral bleaching, in which 
the symbiotic microbes that give corals their 
colour are damaged. 


Did you have an aquarium as a child? 

Yes. My grandmother took me to the local 
aquarium, where I became fascinated with cor- 
als and saltwater invertebrates. Eventually I got 
my own seawater aquarium. This childhood 
hobby developed into an important skill set: I 
am one of the few people in the field who can 
maintain long-term coral collections and use 
them as experimental models in an aquarium. 


Describe your first research finding. 

It was for my senior thesis at the University of 
Ulm in Germany. I came across anemones in 
the Mediterranean Sea that had red fluorescent 
proteins, even though they were non-lumines- 
cent. I realized that, as an ecologist, I would 
need to gain skills in biochemistry, genetics 
and structural biology to study these proteins. 


How did that set your research direction? 

I was lucky. I worked with open-minded bio- 
medical researchers who taught me techniques 
that opened up a new horizon — exploring the 
use of fluorescent proteins in biomedical appli- 
cations. It was very exciting, collaborating with 
experts in different disciplines. 


You have moved through your career quickly. 
What motivated you? 

I finished my PhD in three and a half years when 
colleagues were taking four to five. I realized 
that moving on would help me to further my 
career. After my PhD, the head of the depart- 
ment offered me a group-leader position at 
Ulm. One is expected to move around and gain 
experience, but having worked in an interdisci- 
plinary environment gave me the confidence to 
lead a group. In 2005, I got my habilitation, the 
prerequisite to become a professor in Germany; 
I was three and a half years younger than aver- 
age. Trying to complete tasks in a short time was 
probably an important decision in my career. 


Do you think skipping a postdoc helped you to 
get established? 

I do. It is instructive to have early independ- 
ence, but it can be hard. I needed to get my own 
funding: I could not rely on a senior figure with 
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a grant. Asa result, I applied for, and got, many 
prestigious fellowships. I think demonstrating 
independence helped my ERC application. 


How has moving to Southampton helped you? 

I needed a change. The move allowed me to 
explore questions about coral biology — such 
as what drives stress responses. In my first 
years there, I got anumber of small grants, but 
after three and a half years, my lab needed a 
big proposal to continue to do good science. 


How did you put together your ERC proposal? 

I was convinced that I had a very exciting idea 
that would fit perfectly into the ERC funding 
scope. My collaborators and I collected data to 
convince a sceptical scientific community that 
we could combine lab experiments with field- 
based approaches to get useful answers for 
coral-reef management (J. Wiedenmann et al. 
Nature Clim. Change 3, 160-164; 2013). 


What did it feel like to get the grant? 

It is a game changer. It gives me the freedom 
to do science and produce papers that would 
otherwise be delayed. I am happy to invest this 
time in science instead of another proposal. 


What part does outreach play in your job? 

We have a strong outreach culture at the 
National Oceanography Centre. I built a coral 
aquarium in the reception area, to educate 
visitors and students. Researchers can use it to 
explain how the coral changes with light level. 
Pressing a button changes the illumination of 
the tank from white to blue to bring out the 
fluorescence of the corals, which gives us the 
opportunity to explain the biomedical applica- 
tions for those proteins. m 


INTERVIEW BY VIRGINIA GEWIN 
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Ua SCIENCE FICTION 


CANOPY OF SKULLS 


BY BETH CATO 


edea trod the patchwork shadow 
Mie light beneath the canopy of 

skulls. Set high on poles, the white 
bones had turned varying shades of grey and 
green. Some wore full skins of verdant flora. 
These were the colony elders, the first gen- 
eration on Rikyu. Almost 50 skulls, their 
soulless eye sockets staring at the stars from 
whence they had come. 

As a child, Medea had yearned for her 
own skull to belong among the canopy. She 
had come to accept that she was generations 
too late for such an honour, and now she no 
longer belonged on Rikyu at all. 

Within hours, a shuttle would deliver the 
11 surviving colonists to an orbiter. Medea 
was the youngest at 15, and she itched with 
restlessness. She had no desire to stay, not 
since Mama had died the previous winter. 
And yet, she ached with a need to pay hom- 
age to the only home she knew, to these 
skulls she had adored since childhood. 

“Give me a sign,” she whispered to her 
great-great-grandparents above. She was 
even wearing her furs one final time, so theyd 
recognize her. “Show me how to honour you.” 

Birdlings and slither-slips rustled among 
the ferns. Gnarled vines draped from pole 
to pole brushed her cheeks. She knew each 
elder’s name as a mantra: Kazuo Uematsu, 
Mary Brown, Jorge Jimenez and down the 
line. A few poles were empty, their bones 
swallowed by the forest. 

Her bare feet, callused to stiff leather, paced 
the familiar furrows below the canopy. Some- 
thing smooth slid beneath her toe. She paused 
and picked up a white rock. Odd, to find such 
a rounded stone so far from the river. 

“Forty-nine skulls. Your precious elders. A 
century on this rock, and these are the most 
valuable things here. Pathetic.” 

Medea whirled around. It was Mr Dale, 
the researcher. He had come down on the 
shuttle and intended to stay and study the 
remains of their colony. Her brow furrowed 
as she struggled to understand him. They 
both spoke English, but the ship’s crew used. 
strange words and formed their tongues in 
peculiar ways. 

“Yes. Forty-nine,” she said with a smile, 
pleased at his interest in the canopy. 

He stopped about ten feet away. His pale 
skin gleamed like sickness, though she knew 
it was froma lack of natural light. Strange, to 
think of her own skin turning so pale while 
travelling through space. 
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When two worlds collide. 


“This is my favourite place,’ Medea said. 
“T will miss it most.’ She blinked back tears. 

He tilted an ear as he tried to decipher her 
words, then shook his head. 

“You're not even worth the effort of turning 
on my translator. Well, you'll be gone soon 
enough, and those lucky bastards on the ship 
get to work on civilizing you.” He snorted. 
“Though itd be funny to see you lot learning 
about hot showers and ready food, all that. 
No more hunting, not unless you're going to 
pelt rocks at a refrigerator” He pointed at her. 
“Where... are... your... clothes?” 

His words and accent tumbled together like 
mating coati, but the last bit was indeed easier 
to understand. “At my house. I needed to wear 
my furs to come here and say good-bye.” 

Medea did love how her new clothes fit 
— soft as otterette skin, though strange in 
how they covered her from neck to ankle. 


The processed protein 
> NATURE.COM nuggets they expected 
Follow Futures: her to eat were not as 
Y @NatureFutures welcome. 


El go.nature.com/mtoodm She rubbed the 
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smooth stone in the well of her palm and 
wondered what the elders wanted of her. 

“Ah, little barbarian. It’s quite understanda- 
ble when alien races engage in such practices, 
but for humans to degrade to ancient meat- 
eating-and-wearing days? That's repulsive.” 
He sighed and jerked his hand overhead at 
the skull of Aamina Mukherjee. “And no con- 
cept of money, no concept of worth. This lot 
crashed on this uncharted, un-terra-formed 
slag heap, and you may as well have landed 
in the Mesozoic.” 

Medea couldnt help but perk up. “That’s 
Aamina! She was a botanist. She cultivated 
the vines we use for thatching and snares, 
and genetically tweaked them to reduce skin 
allergies. Isn't she beautiful with her draping 
flowers? You're lucky to see her in bloom” 

Homesickness knotted in her gut, even 
though she had yet to leave home. She was 
so excited to see other worlds, and yet — she 
envied this man for staying here. She thought 
on something Mr Dale said — the ship’s 
crew being “lucky bastards”. They thought 
as highly of him. Shed heard the captain say 
that Mr Dale was a “crooked bastard” who 
deserved to stay ona planet like Rikyu. 

“Look at you,” said Mr Dale, laughing. 
“You're daft. You'll probably end up whor- 
ing on Janero. You have no idea that some- 
thing like this — this bone on a stick — will 
sell for a million genny-slips because of the 
story and tragedy of this place. You're a new- 
fangled Donner Party. I'll make my fortune 
here.” He gripped a vacant pole, grinning. 

Here. The depth of meaning in that one 
word resonated. Sunlight sliced through the 
opening in the canopy and illuminated his 
bare head. Birdlings hushed. Medea smiled 
acknowledgment to the skulls and to Mr 
Dale's fervent request. 

The rock flew from her hand. His skull 
cracked, sound reverberating through the 
canopy. Mr Dale tumbled backwards with a 
crackle of underbrush. 

“You are a first generation to come here! If 
I could stay with the elders, I would. You are 
lucky to belong here!” 

Tears filled her eyes as she unsheathed her 
knife. She had just enough time to set him on 
a pole. Medea gazed up at the heavens, like 
the skulls around her, and watched for the 
glint of the orbiter above. m 


Beth Cato resides in Arizona. Her work can 
be found in Daily Science Fiction and Flash 
Fiction Online. For information on her latest 
projects, please visit www.bethcato.com. 
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